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OF STEROID COMPOUNDS 



5 TECHNICAL FIELD 

The present invention relates to biotechnology 
with an emphasis on plant biotechnology, and 
particularly biotechnology affecting the biosynthesis 
of steroid compounds . 

10 

BACKGROUND 

Enhancement of the nutritional or health benefits 
of oils through genetic engineering is being addressed 
throughout the agricultural community. Several 

15 approaches involve manipulation of already present 

cellular biosynthetic pathways. Steroid biosynthetic 
pathways are of current interest, particularly for the 
enhancement of health benefits from food oils. 

Several related U.S. patents address increasing 

20 sterol accumulation in higher plants. Those patents 
include U.S. Patent No. 5,589,619 "Process and 
Composition for increasing squalene and sterol 
accumulation in higher plants" (accumulation of 
squalene in transgenic plants by increasing HMGR 

25 activity) and U.S. Patent No. 5,306,862 "Method and 
composition for increasing sterol accumulation in 
higher plants" (increasing HMGR activity to increase 
plant sterol accumulation- -including sterol and 
cycloartenol , which affects insect resistance- - in 

30 tobacco, tomato, corn, carrot, soybean, cotton, barley, 
arabidopsis, guayule and petunia; seeds with elevated 
sterol/cycloartenol, 7S promoter and CaMV promoters), 
U.S. Patent No. 5,365,017 "Method and composition for 
increasing sterol accumulation in higher plants" (DNA 
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construct with HMGR-CaMV 35S, transgenic plants, hybrid 
plants, corn, soy, barley, tomato, Arabidopsis) , U.S. 
Patent No. 5,349,12 6 "Process and composition for 
increasing squalene and sterol accumulation in higher 
plants" (increase in squalene and sterol accumulation 
by increasing HMGR activity in transgenic tobacco, 
cotton, soybean, tomato, alfalfa, Arabidopsis , corn, 
barley, carrot and guayule plants) , and EP 486290 
(enhancement of squalene and specific sterol . [squalene 
zymosterol, cholest -7 , 24 -dienol , cholest -5 , 7 , 24 - 
trienol] accumulation in yeast by increasing HMGR 
activity in yeast deficient in enzymes that convert 
squalene to ergosterol) . 

In those patents, the amount of a protein 
exhibiting 3 -hydroxy-3 -methylglutaryl Coenzyme-A 
reductase (HMGR) activity is typically increased. HMGR 
widens a "bottleneck" near the beginning of a 
biosynthetic path to steroid production, permitting a 
higher carbon flux through steroid biosynthetic 
pathways and resulting in increased sterol 
accumulation. 

U.S. Patent No. 5,480,805 "Composition for 
modulating sterols in yeast" (enhancement of delta 8-7 
isomerase activity-ERG2 enhances accumulation of 
specific sterols in yeast) . 

U.S. 5,460,949 "Method and composition for 
increasing the accumulation of squalene and specific 
sterols in yeast" (increasing squalene, zymosterol and 
specific sterols in yeast by increasing HMGR in yeast 
having decreased erg5 and erg6 activity- -Sc and hamster 
HMGR) . 

WO 9845457 (SMTI, Erg6 from A. t., corn, yeast; 
transgenic plants with altered sterol level s_using DNA 
encoding an enzyme binding a first sterol and producing 
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a second sterol - -altered carotenoid, tocopherol, 
modified FA levels- -HMGR, 5a-reductase, geranylgeranyl 
pyrophosphate synthase, phytoene synthase, phytoene 
desaturase, isopentenyl diphosphate isomerase) . 

Acetate is the metabolic precursor of a vast array 
of compounds vital for cell and organism viability. 
Acetyl coenzyme A (CoA) reacts with acetoacetyl CoA to 
form 3-hydroxy-3 methylglutaryl CoA (HMG-CoA) . HMG-CoA 
is reduced to mevalonate in an irreversible reaction 
catalyzed by the enzyme HMG-CoA reductase. Mevalonate 
is phosphorylated and decarboxylated to isopentenyl - 
pyrophosphate (IPP) . Through the sequential steps of 
isomerization, condensation and dehydrogenat ion, IPP is 
converted to geranyl pyrophosphate (GPP) . GPP combines 
with IPP to form farnesyl pyrophosphate (FPP) , two 
molecules of which are reductively condensed to form 
squalene, a 3 0 -carbon precursor of sterols. 

A key enzyme in sterol biosynthesis is 3 -hydroxy - 
3 -methylglutaryl -Coenzyme A reductase (HMG-CoA 
reductase or HMGR). Schaller et al . (Plant Physiol. 
109: 761-770, 1995) found that over-expression of 
rubber HMGR {hmgl) genomic DNA in tobacco leads to the 
overproduction of sterol end-products (sitosterol, 
campesterol and stigmasterol) up to 6-fold in leaves. 
Further, the excess sterol was stored as steryl -esters 
in lipid bodies. HMGR activity was increased by 4- to 
8-fold. 

Sterols are derivatives of a fused, reduced ring 
system, cyclopenta- [a] -phenanthrene, comprising three 
fused cyclohexane rings (A, B, and C) in a phenanthrene 
arrangement, and a terminal cyclopentane ring (D) 
having the formula (I) and carbon atom position 
numbering shown below: 
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where R is an 8 to 10 carbon-atom side chain. 

In plants, squalene is converted to squalene 
epoxide, which is then cyclized to form cycloartenol 
(4,4, 14a-trimethyl-9p, 19-cyclo-5a-cholest-24-en-3p-ol) . 
Cycloartenol has two methyl groups at position 4, a 
methyl group at position 14, a methylene bridge between 
the carbon atoms at positions 9 and 19 that forms a 
disubstituted cyclopropyl group at those positions, and 
includes an 8 -carbon sidechain of the formula: 
CH 3 CH(CH 2 ) 2CH=C (CH 3 ) 2 • Squalene epoxide can 
alternatively be converted into pentacyclic sterols, 
containing five instead of four rings. Exemplary 
pentacyclic sterols include the phytoalexins and 
saponins . 

Being one of the first sterols in the higher plant 
biosynthetic pathway, cycloartenol serves as a 
precursor for the production of numerous other sterols. 
In normal plants, cycloartenol is converted to 
predominantly 24 -methylene cycloartenol (4,4,14a- 
dimethyl-9p, 19-cyclo-22 , 23 -dihydro-ergosta-24 (28) -en-3- 
P-ol) , cycloeucalenol, (4 , 14a-trimethyl -9p, 19 cyclo-5a- 
ergosta-24 (28) -en-3p-ol) , isof ucosterol (5a-stigmasta- 
5-24 (28) -dien-3P-ol) , sitosterol (5a-stigmasta-5-en-3p- 
ol) , stigmasterol- (stigmasta-5 , -22 -dien-3P-ol ) , 
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campesterol (5a-ergosta-5-en-3p-ol) , and cholesterol 
(5a-cholesta-5-en-3P-ol) . These transformations are 
illustrated in Fig. 1. 

Although sterols produced by plants, and 
particularly higher (vascular) plants, can be grouped 
by the presence or absence of one or more of several 
functionalities, plant sterols are classified into two 
general groups herein; i.e., those containing a double 
bond between the carbon atoms at positions 5 and 6 
(delta-5 or A5 sterols) and those not containing a 
double bond between the carbon atoms at positions 5 and 
6 (non-delta-5 sterols) . 

Exemplary naturally-occurring delta-5 plant 
sterols are isof ucosterol , sitosterol, st igmasterol , 
campesterol, cholesterol, and dihydrobrassicasterol . 
Exemplary naturally occurring non-delta-5 plant sterols 
are cycloartenol , 24 -methylene cycloartenol , 
cycloeucalenol, and obtusif oliol . The most abundant 
sterols of vascular plants are campesterol, sitosterol, 
and stigmasterol , all of which contain a double bond 
between the carbon atoms at positions 5 and 6 are 
classified as delta-5 sterols. 

The HMG-CoA reductase enzymes of animals and 
yeasts are integral membrane glycoproteins of the 
endoplasmic reticulum. The intact enzyme comprises 
three regions: a catalytic region containing the active 
site of the enzyme; a membrane binding region anchoring 
the enzyme to the endoplasmic reticulum; and a linker 
region joining the catalytic and membrane binding 
regions of the enzymes . The membrane binding region 
occupies the amino- terminal (N- terminal) portion of the 
intact protein, whereas the catalytic region occupies 
the carboxy-terminal (C- terminal) portion of the 



6 MTC 6783.1 

PATENT 

protein, with the linker region constituting the 
remaining portion. M.E. Basson et al . , Mol . Cell 
Biol., 8 (9) :3797-3808 (1988). 

The activity of HMG-CoA reductase in animals and 
yeasts is known to be subject to feedback inhibition by 
sterols. Such feedback inhibition requires the 
presence of the membrane binding region of the enzyme. 
See, e.g., G. Gil et al. Cell, 41:249-258 (1985); M . 
Bard and J.F. Downing, J". Gen. Microbiol., 124:415-420 
(1981) . 

Given that mevalonate is the precursor for sterols 
and other isoprenoids, it might be expected that 
increases in the amount or activity of HMG-CoA 
reductase would lead to increases in the accumulation 
of both sterols and other isoprenoids. 

In mutant strains of the yeast Saccharomyces 
cerevisiae (S. cerevisiae) having abnormally high 
levels of HMG-CoA reductase activity, the production of 
two sterols, 4 , 14 -dimethyl zymosterol and 14- 
methylfucosterol is markedly increased above normal. 
Downing, et al . , Biochem. Biophys . Res. Comm., 94(3): 
874-979 (1980) . 

When HMG-CoA reductase activity was increased by 
illumination in non-photosynthetic microorganisms, 
isoprenoid (carotenoid) , but not sterol (ergosterol) , 
synthesis was enhanced. Tada, et al . , Plant and Cell 
Physiology, 23 (4 ) : 615- 62 1 (1982). 

WO 9703202 discloses a method for identifying 
agents modulating sterol biosynthesis using a yeast 
acetoacetyl CoA thiolase (ERG10) gene linked to a 
reporter system to evaluate compounds, such as 
lovastatin and other HMG-CoA synthase inhibitors, that 
affect cholesterol biosynthesis. 
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U.S. Patent No. 5,668,001 teaches a recombinant 
avian HMG-CoA synthase preparation useful for 
evaluating drugs that inhibit cholesterol biosythesis. 

JP 09121863 discloses a plant with increased 3- 
hydroxy- 3 -methyl glutaryl coenzyme A reductase (HMGR) 
activity as a result of increasing the expression of a 
mutant protein kinase gene that regulates expression of 
the HMGR gene. The increased HMGR activity increases 
squalene, zymosterol, cholesta-7 , 24-dienol and cholest- 
5 , 7 , 24-trenol accumulation in yeast with ERG5 and ERG6 
mutants . 

EP 480730 "Plant-sterol accumulation and pest 
resistance-by increasing copy number of 3-hydroxy-3- 
methyl glutaryl coenzyme -A reductase gene in tobacco, 
tomato and corn 

WO 9913086 "Human Delta 7-sterol reductase 
polypeptide-usef ul for diagnosis or treatment of 
genetic defects e.g. hereditary Smith-Lemli-Opitz 
syndrome" teaches making and using the recombinant 
polypeptide with humans. 

Chappell et al . U.S. Patent No. 5,589,619 teaches 
that transformation of higher plants with truncated 
HMG-CoA reductase enhanced the production of squalene, 
cycloartenol and certain sterols, particularly 
compounds having unsaturations at the 5 -position. 
Several intermediate sterols as are shown in Fig. 1 
were also produced. It would be beneficial if the 
production of sitosterol and stigmasterol could be 
enhanced while lessening the accumulation of the 
intermediate sterols. The present invention provides 
avenues for enhancing production of sitosterol and 
stigmasterol and lessening the accumulation of the 
intermediate sterols. 
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Gonzalez et al . (Abstract of poster at Third 
Terpnet Meeting of the European Network on Plant 
Isoprenoids, May 29-30, 1997, Poitiers, France) over- 
expressed the Arabidopsis HMGR cDNA {hmgl and hmg2) and 
5 found sterol overproduction with hmgl only. They used 
two forms of the hmgl gene, a full-length form and a 
truncated form containing only the catalytic domain. 
HMGRs have three domains, an N- terminal membrane 
spanning domain, a short linker domain, and a C- 

10 terminal catalytic domain. In this case the transgenic 
plants were also Arabidopsis . The difference between 
the full-length and truncated forms was a greater 
accumulation of pathway intermediates in the case of 
the truncated form. More importantly, the intermediates 

15 demonstrated as accumulating were cycloartenol , 24- 
methylenecycloartanol and obtusif oliol . 

Finally, US patents 5,365,017 and 5,306,862, both 
assigned to Amoco Corp., disclose a method for 
increasing sterol accumulation in plants by increasing 

20 the copy number of a gene having HMG-CoA reductase 
activity. These inventions disclose a method using 
hamster truncated HMGR that consisted of the catalytic 
domain and the linker domain. According to the claims 
the linker domain was essential for activity. They also 

25 demonstrated a greater accumulation of pathway 
intermediates such as cycloartenol. 

BRIEF SUMMARY 

The present invention relates to transgenic plants 
30 and their progeny having improved nutritional 

characteristics. More particularly, the present 
invention relates to transgenic plants and their 
progeny, the storage organs (e.g. seed, fruit and 
vegetable parts) of which contain modified levels of 
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steroid compounds, such as (i) elevated levels of 
beneficial phytosterols (e.g., sitosterol), 
phytostanols (e.g., sitostanol) , and esters thereof, 
relative to an otherwise identical plant transformed 
only with a truncated HMG-CoA reductase gene or a wild- 
type plant, and (ii) reduced levels of steroid pathway 
intermediate compounds (e.g. one or more of squalene, 
cycloartenol , 24 -methylene cycloartenol , obtusif oliol , 
stigmasta-7-enol and campesterol) in their storage 
organs relative to an otherwise identical transgenic 
plant transformed only with a truncated HMG-CoA 
reductase gene. Nucleic acid sequences encoding 
enzymes that affect the biosynthesis and accumulation 
of steroid compounds in plants (HMG-CoA reductase and a 
steroid pathway enzyme) , and methods for using these 
sequences to produce such transgenic plants, are also 
provided. These methods comprise, for example, 
introducing into cells nucleic acid sequences encoding 
enzymes that affect the levels of accumulated steroid 
pathway end products . 

The present invention contemplates a recombinant 
construct or a recombinant vector that contains 2 DNA 
sequences. The first encodes a polypeptide exhibiting 
3 -hydroxy- 3 -methylglutaryl - Coenzyme A (HMG-CoA) 
reductase activity. The second DNA sequence encodes a 
polypeptide exhibiting the activity of another steroid 
pathway enzyme. Each polypeptide-encoding DNA sequence 
is operably linked in the 5 ' to 3 1 direction to a 
promoter and a transcription termination signal 
sequence independent of the other sequence. The 
promoter is located upstream and the termination 
sequence downstream of each polypeptide-encoding DNA 
sequence. The second DNA sequence encoding a steroid 
pathway enzyme can code for a squalene epoxidase 
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enzyme, a sterol methyl transferase I enzyme, a sterol 
C4 -demethylase enzyme, a obtusifoliol C14oc-demethylase 
enzyme, a sterol C5 -desaturase enzyme, or a sterol 
methyl transferase II enzyme. It is contemplated that 
5 HMG-CoA reductase and the steroid pathway enzyme 

activity comes from a mutant or truncated form of those 
enzymes, such as a truncated HMG-CoA reductase lacking 
the transmembrane region while retaining a functional 
catalytic domain. Examples of such preferred HMG CoA 

10 reductases include the truncated rubber and Arabidopsis 
HMG CoA reductases disclosed herein. 

Preferably, the regulatory function of a promoter 
is substantially unaffected by cellular levels of 
squalene such as the CaMV 3 5S promoter. In one aspect, 

15 a promoter is seed- specif ic . In another aspect, a 

promoter is derived from a species in a different order 
from a host cell. In another aspect, the HMG-CoA 
reductase or steroid pathway enzymes is from a species 
in a different order from the order that of the host 

20 cell . The invention contemplates a construct or 

recombinant vector having more than one DNA sequence 
encoding a steroid pathway enzyme that do not have to 
be under the control of the same promoter. Preferably, 
a recombinant vector is a plant expression vector. 

25 In another aspect of the invention, a transformed 

host cell comprises a recombinant construct or vector 
as described above. Preferably, a host cell is plant 
cell, preferably that plant cell is from canola, 
soybean, corn, maize, tobacco, cotton, rape, tomato or 

30 alfalfa. The invention contemplates a host cell in a 
cell culture, plants derived from transformed host 
cells, and storage organs (seeds, fruits and vegetable 
parts) from transgenic plants. 
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In addition to contemplating transgenic plants and 
seeds, the invention contemplates transgenic plant 
seeds capable of germinating into a transgenic plant 
and mutants, recombinants, genetically engineered 
derivatives thereof and hybrids derived therefrom. The 
plant over-accumulates steroid pathway products 
relative to a native, non- transgenic plant of the same 
strain, wherein said mutants, recombinants, genetically 
engineered derivatives thereof and hybrids derived 
therefrom maintain the ability to overaccumulate 
steroid pathway products. 

The invention contemplates a process of increasing 
the formation of steroid pathway products in a 
transformed host cell as compared to an otherwise 
identical non- trans formed host cell. Contemplated 
processes use the described recombinant constructs and 
vectors to transform host cells, then growing the host 
cells or regenerating transgenic plants therefrom. 

In one aspect of the invention, the genome of a 
contemplated plant, its progeny, seeds or cell culture, 
comprises introduced DNA encoding an HMG-CoA reductase 
activity and introduced DNA encoding a steroid pathway 
enzyme that is a squalene epoxidase enzyme, a sterol 
methyl transferase I enzyme, a sterol C4 -demethylase 
enzyme, a obtusifoliol C14a-demethylase enzyme, a 
sterol C5-desaturase enzyme, or a sterol methyl 
transferase II enzyme. The storage organs of such a 
plant contain an elevated level of total accumulated 
sterol, compared to storage organs of an otherwise 
identical plant, the genome of which does not comprise 
said introduced DNA. Further, the storage organs of 
the plant contain a reduced level of squalene, 
cycloartenol , 24 -methylene cycloartenol , obtusifoliol, 
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stigmasta-7-enol , or campesterol compared to the seeds 
of an otherwise identical plant or a plant comprising 
an introduced DNA encoding an HMG-CoA reductase enzyme. 
The invention contemplates a method of producing a 
5 plant that accumulates an elevated level of sterol 
pathway products compared to a corresponding plant 
comprising no introduced DNA encoding a peptide, 
polypeptide, or protein that affects the biosynthesis 
and accumulation of a sterol pathway product, 

10 comprising sexually crossing plants to arrive at a 
plant comprising nucleic acid encoding an HMG CoA 
reductase and a steroid pathway enzyme, including 
crosses with a nurse cultivar. The plants, including 
apomicitic plants, uniform populations of the plants 

15 and their seeds and parts other than seeds are 
contemplated . 

Another aspect of the invention is oils containing 
at least one sterol pathway product, extracted from the 
seeds of a contemplated plant. Preferably sitosterol, 

20 at least one sitosterol ester, or mixtures thereof, 
comprise at least about 57% by weight of the total 
sterol compounds of a contemplated oil. Preferably 
sitosterol, that at least one sitosterol ester, or 
mixtures thereof, comprise at least about 0.08% of the 

25 dry weight of a contemplated seed. Preferably, the oil 
has a reduced amount of squalene, cycloartenol , 24- 
methylene cycloartenol, obtusif oliol , stigmasta- 7-enol , 
campesterol, or combinations thereof, compared to oil 
from a corresponding transgenic plant that does not 

30 contain introduced DNA encoding a squalene epoxidase 
enzyme, a sterol methyl transferase I enzyme, a sterol 
C4-demethylase enzyme, a obtusif oliol C14a-demethylase 
enzyme, a sterol C5 -desaturase enzyme, a sterol methyl 
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transferase II enzyme, or mixture thereof; wherein the 
reduction is in the range of from about 10% to about 
100%. 

Sitosterol ester compositions derived from 
5 transgenic plants of the present invention, their 
progeny or their seeds are also contemplated, 
preferably wherein an esterifying fatty acid has 2 to 
2 2 carbon atoms in the main chain. 

A further aspect of the invention is cholesterol- 
10 lowering compositions comprising contemplated oils and 
sitosterol ester compositions. Another further aspect 
of the invention is foods, food ingredients, or food 
compositions comprising contemplated oils. 

Still further, the invention contemplates 
15 pharmaceutical compositions comprising a cholesterol- 
lowering effective amount of a contemplated oil, and a 
pharmaceutical ly acceptable carrier, excipient, or 
diluent . 

A method of lowering the plasma concentration of 
20 low density lipoprotein cholesterol is contemplated, 
comprising orally administering to a human or animal 
subject an effective amount of an above composition. 
Also contemplated is a method of treating or preventing 
an elevated plasma concentration of low-density 
25 lipoprotein cholesterol, comprising orally 

administering to a human or animal subject an effective 
amount of a contemplated composition. 

A related aspect of the invention is a method of 
making a food additive composition, comprising 
30 obtaining oil containing a sterol pathway product 

compound from seed of a contemplated transgenic plant 
and mixing the oil with an edible solubilizing agent, 
an effective amount of a dispersant, and optionally, an 
effective amount of an antioxidant . 
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Novel forms of two sterol pathway enzymes and the 
nucleic acids that encode them are disclosed: an 
Arabidopsis enzyme having nucleic acid similarity to a 
squalene epoxidase, and an Arabidopsis enzyme having 
5 nucleic acid similarity to an obtusifoliol C14a- 

demethylase enzyme. Thus, the invention contemplates 
an isolated DNA molecule having a nucleotide sequence 
of disclosure SEQ ID NO: 4, 6, 8, 10, 14, 15, 17 or the 
complements thereof. Also contemplated is a nucleotide 

10 sequence that hybridizes to the nucleotide sequence of 
SEQ ID NO:4, 6, 8, 10, 14, 15, 17 or their complements 
under a wash stringency equivalent to 0 . 5X SSC to 2X 
SSC, 0.1% SDS, at 55-65°C, and that encode a 
polypeptide having squalene epoxidase or obtusifoliol 

15 C14a-demthylase enzymatic activity. Preferably, that 
enzymatic activity is substantially similar to that of 
a disclosed squalene epoxidase or obtusifoliol C14<x- 
demethylase, respectively. By substantially smiliar is 
meant having enzymatic activity differing from that 

20 of the disclosed enzymes by about 3 0% or less, 

preferably by about 2 0% or less, and more preferably 
by about 10% or less when assayed by standard 
enzymatic assays. Also contemplated is a nucleotide 
sequence encoding the same genetic information as said 

25 nucleotide sequence of SEQ ID NO : 4, 6, 8, 10, 14, 15, 
17 or their complements or that hybridize as described 
above, but which is degenerate in accordance with the 
degeneracy of the genetic code. Recombinant 
constructs, vectors and transformed host cells 

30 comprising the novel isolated and purified nucleic acid 
sequences are also contemplated. In one embodiment, 
the vector is a plant vector and the host cell is a 
plant cell. Methods of producing the disclosed 
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squalene epoxidase or obtusifoliol C14a-demethylase 
enzymes are also contemplated comprising culturing a 
transformed host cell for a time and under conditions 
conductive to the production of the squalene epoxidase 
or obtusifoliol C14a-demethylase enzyme, and 
recovering the produced squalene epoxidase or 
obtusif ol iol C14cc-demethylase enzyme. 

Yet another aspect provides any of the above 
described transformed host cells, further comprising a 
recombinant construct or expression vector encoding a 
tocopheral synthesis pathway enzyme, and in particular, 
S-adenosylmethionine- dependent a- tocopherol 
methyltransf erase . Also included are plants, seeds and 
storage organs comprising the transformed host cells. 

Another aspect provides, a process of increasing 
the formation of steroid pathway products and 
tocopherols in a transformed host cell as compared to 
an otherwise identical non- transformed host cell 
comprising (1) transforming a host cell with a 
recombinant vector comprising (a) as operably linked 
components in the 5' to 3 1 direction, a promoter, a DNA 
sequence encoding a first polypeptide having 3 -hydroxy - 
3 -methylglutaryl -Coenzyme A reductase enzyme activity, 
and a transcription termination signal sequence; and 
(b) as operably linked components in the 5 ' to 3 ' 
direction, a promoter, a DNA sequence encoding at least 
one polypeptide having steroid pathway enzyme activity 
selected from the group consisting of squalene 
epoxidase enzyme activity, sterol methyl transferase I 
enzyme activity, sterol C4 -demethylase enzyme activity, 
obtusifoliol C14a-demethylase enzyme activity, sterol 
C5-desaturase enzyme activity, and sterol methyl 
transferase II enzyme activity, and a transcription 
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termination signal sequence; (2) transforming the host 
cell of (1) with a recombinant vector comprising as 
operably linked components, a promoter, a DNA sequence 
encoding a tocopherol synthesis pathway enzyme, and a 
5 transcription termination sequence; and (3) 

regenerating said transformed plant cell into said 
transgenic plant . 



10 BRIEF DESCRIPTION OF THE DRAWINGS 

These and other features, aspects, and advantages 
of the present invention will become better understood 
with regard to the following description, appended 
claims and accompanying figures where: 

15 Fig. 1 is an abbreviated version of a plant 

steroid compound biosynthetic pathway that shows the 
enzymes affecting steroid compound biosynthesis and 
accumulation. These include: HMG-CoA reductase, 
squalene epoxidase, sterol methyl transferase I, sterol 

20 C4 -demethylase , obtusifoliol C14a-demethylase , sterol 
C5 desaturase and sterol methyl transferase II. 

Fig. 2 depicts the forms of Arabidopsis and rubber 
HMGR1 tested in Arabidopsis and yeast to compare 
expression, activity and sterol production. 

25 Fig. 3 is a map showing the structure of construct 

pMON29920. pMON29920 is a binary transformation vector 
with P-7S/E9 3' cassette and the KAN gene flanked by 
the two borders whe re P-7S is the promoter of alpha' 
beta conglycinin protein from soybean, E9 3' is the 3' 

30 end of pea rbc E9 gene and KAN is the coding sequence 
for NPTII that confers resistance to kanamycin. The 
NPTII gene is driven by the 35S promoter from 
cauliflower mosaic virus. Spc.Str is the coding region 
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for Tn7 adenylyltransf erase conferring resistance to 
spectinomycin and streptomycin; ori-V: the vegetative 
origin of replication; rop : coding region for repressor 
of primer; ori-322 : minimum known sequence required for 
5 a functional origin of replication; NOS 3': the 3' 
termination end of nopal ine synthase coding region. 

Fig. 4 is a map showing the structure of construct 
pMON43800. pMON438 0 0 is a recombinant binary vector 
for Agrobacterium-mediated transformation, carrying the 

10 rubber HMGR1 gene cassette. The HMGR1 gene is driven 
by the 7S alpha' beta conglycinin promoter from 
soybean. P-7S: 7S promoter, rubber HMGR1 gene: coding 
sequence for 3 -hydroxy- 3 -methylglutaryl reductase from 
Hevea brasiliensis; E9 3': 3' end of pea rbcS E9 gene; 

15 P-35S: 35S promoter from cauliflower mosaic virus; KAN: 
coding region for NPTII gene conferring resistance for 
kanamycin; NOS 3': 3' termination end of nopaline 
synthase coding region: Left Border: Octapine left 
border from Octapine Ti plasmid pTiA6 ; ori-V: the 

20 vegetative origin of replication; rop: coding region 

for repressor of primer; Spc/Str: coding region for Tn7 
adenylyltransf erase conferring resistance to 
spectinomycinand streptomycin. 

Fig. 5 is a map showing the structure of construct 

25 pMON23616. pMON23616 is a plant expression vector 

containing P-NOS/ORF-7/KAN/NOS-3 ' cassette. P-NOS: NOS 
promoter from Agrobacterium tumefaciens pTiT37; ORF-7: 
a short open reading frame that attenuates expression 
of KAN in plants; KAN: coding sequence of NPTII gene 

30 that confers resistance to kanamycin and neomycin; ble: 
confers resistance to bleomycin; NOS 3': 3' termination 
end of nopaline synthase coding region; Left Border: 
Octapine left border from Octapine Ti plasmid pTiA6 ; 
ori-V: the vegetative origin of replication; rop: 
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coding region for repressor of primer; Spc/Str: coding 
region for Tn7 adenylyltransf erase conferring 
resistance to spectinomycin and streptomycin. 

Fig. 6 is a map showing the structure of construct 
pMON43818. pMON43818 is a recombinant binary vector 
carrying the gene encoding rubber hydroxymethyl 
glutaryl CoA reductasel (HMGR1) in sense orientation 
driven by the soybean alpha' beta conglycinin promoter. 
P-NOS: nopal ine synthase gene promoter; kan: coding 
region for neomycin phospho transferase protein to 
confer resistance to kanamycin; NOS 3': 3' termination 
end of nopal ine synthase coding region; Soy Alpha' Beta 
Conglycinin: 7S alpha' beta conglycinin gene promoter 
from soybean; Rubber HMGR1 gene: coding sequence for 
HMGR1 gene from Hevea brasiliensis ; E9 3' : 3' end of 
pea rbcS E9 gene; Left border: octopine left border, 
sequence essential for transfer of T-DNA into 
Agrobacterium; ori-V: plasmid origin of replication in 
Agrobacterium; rop : coding sequence for repressor of 
primer; Ori-322: origin of replication in E.coli; 
Spc/Str: coding region for Tn7 adenylyltransf erase 
(AAD (3 " ) ) conferring resistance to spectinomycin and 
streptomycin; Right Border: right border sequence of T- 
DNA essential for integration into Agrobacterium. 

Figure 7 is a map showing the structure of 
construct pMON43052. pMON43052 is a recombinant 
shuttle vector, carrying the cDNA fragment encoding the 
catalytic domain of Arabidopsis HMGR1 in sense 
orientation driven by the soybean alpha' beta 
conglycinin promoter. P-NOS: nopaline synthase gene 
promoter; kan: coding region for neomycin 
phosphotransferase protein to confer resistance to 
kanamycin; NOS 3': 3' termination end of nopaline 
synthase coding region; Soy Alpha' Beta Conglycinin: 7S 
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alpha' beta conglycinin gene promoter from soybean; 
Arabidopsis HMGR catalytic domain: coding sequence for 
the catalytic domain of Arabidopsis HMGR1 protein; E9 
3' : 3' end of pea rbcS E9 gene; Left border: octopine 
left border, sequence essential for transfer of T-DNA 
into Agrobacterium; ori-V: plasmid origin of 
replication in Agrobacterium; rop: coding sequence for 
repressor of primer; Ori-322: origin of replication in 
E.coli; Spc/Str: coding region for Tn7 

adenylyltransf erase (AAD (3 " ) ) conferring resistance to 
spectinomycin and streptomycin; Right Border: right 
border sequence of T-DNA essential for integration into 
Agroba cterium. 

Figure 8 is a map showing the structure of 
construct pMON51850. pMON51850 is a binary vector for 
Agrobacterium mediated transformation of soybean. P- 
NOS : nopaline synthase gene promoter; kan : coding 
region for neomycin phosphotransferase protein to 
confer resistance to kanamycin; NOS 3': 3' termination 
end of nopaline synthase coding region; Left border: 
octopine left border sequence essential for transfer of 
T-DNA into Agrobacterium; ori-V: plasmid origin of 
replication in Agrobacterium; rop: coding sequence for 
repressor of primer; ori-322: origin of replication in 
E.coli; Spc/Str: coding region for Tn7 

adenylyltransf erase (AAD (3 " ) ) conferring resistance to 
spectinomycin and streptomycin; Right Border: right 
border sequence of T-DNA essential for integration into 
Agroba cteri urn . 

Figure 9 is a map showing the structure of 
construct pMON43 057. pMON43 057 is a recombinant binary 
vector for Agrobacterium mediated transformation of 
soybean, carrying the gene cassette for expressing 
catalytic domain of HMGR1 from Arabidopsis thaliana . 
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The catalytic domain of the HMGR1 cDNA is driven by 
soybean 7S alpha' beta conglycinin promoter. P-NOS : 
nopal ine synthase gene promoter; kan: coding region for 
neomycin phosphotransferase protein to confer 
5 resistance to kanamycin; NOS 3': 3' termination end of 
nopal ine synthase coding region; Left border: octopine 
left border sequence essential for transfer of T-DNA 
into Agrobacterium; ori-V: plasmid origin of 
replication in Agrobacterium; rop : coding sequence for 
10 repressor of primer; ori-322: origin of replication in 
E.coli; Spc/Str: coding region for Tn7 

adenylyltransf erase (AAD (3 " ) ) conferring resistance to 
spectinomycin and streptomycin; Right Border: right 
border sequence essential for transfer of T-DNA into 
15 Agrobacterium; Soy Alpha' Beta Conglycinin: soybean 7S 
alpha' beta conglycinin gene promoter; Arabidopsis 
HMGR catalytic domain: coding sequence for Arabidopsis 
HMGR1 catalytic domain; E9 3': 3' end of pea rbcS E9 
gene . 

20 Figure 10 is a map showing the structure of 

construct pMON43058. pMON43058 is a recombinant binary 
vector for Agrrorjacterium-mediated soybean 
transformation, carrying gene expression cassettes for 
catalytic domain of HMGR1 from Arabidopsis thaliana and 

25 SMTII from Arabidopsis thaliana. P-NOS: nopaline 

synthase gene promoter; kan: coding region for neomycin 
phosphotransferase protein to confer resistance to 
kanamycin; NOS 3': 3' termination end of nopaline 
synthase coding region; Left border: octopine left 

30 border sequence essential for transfer of T-DNA into 

Agrobacterium; ori-V: plasmid origin of replication in 
Agrobacterium; rop: coding sequence for repressor of 
primer; ori-322: origin of replication in E.coli; 
Spc/Str: coding region for Tn7 adenylyltransf erase 
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(AAD (3 " ) ) conferring resistance to spectinomycin and 
streptomycin; Right Border: right border sequence 
essential for transfer of T-DNA into Agrobacterium; Soy 
Alpha' Beta Conglycinin: 7S alpha' beta conglycinin 
gene promoter from soybean; Arabidopsis HMGR catalytic 
domain: sequence encoding the catalytic domain of 
Arabidopsis HMGR1; E9 3': 3' end of pea rbcS E9 gene; 
Soy Alpha' Beta Conglycinin: soybean 7S alpha' beta 
conglycinin gene promoter; Arabidopsis SMT2 : cDNA 
encoding sterol methyl transferase II enzyme from 
Arabidopsis thaliana (accession no: X89867) ; NOS 3': 3' 
termination end of nopaline synthase coding region. 

Fig. 11 is profile (histogram) of the sterol 
composition of Rl transgenic soybean seeds when 
Arabidopsis truncated HMGR (catalytic domain without 
linker) was overexpressed using seed-specific 7S 
promoter (data from pMON43057 : p7S : : At HMGR truncated). 

Fig. 12 is a profile (histogram) of the sterol 
composition of Rl transgenic soybean seeds when 
Arabidopsis truncated HMGR (catalytic domain without 
linker) and Arabidopsis SMTII were overexpresed (data 
from pMON43058 :p7S : :At HMGR truncated and p7S::At 
SMTII) . The expression of the genes is controlled by 
the seed-specific 7S promoter. 

Figure 13 is a map showing the structure of 
construct pMON53733. pMON53733 is a recombinant binary 
vector carrying the cDNA encoding full-length form of 
Arabidopsis hydroxymethyl glutaryl CoA reductase 1 
(HMGR1) in sense orientation driven by the enhanced 
cauliflower mosaic virus 35S promoter. P-35S: 35S 
promoter from cauliflower mosaic virus; kan: confers 
resistance to neomycin and kanamycin; NOS 3': 3' 
termination end of nopaline synthase coding region; 
Left border: octopine left border, sequence essential 
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for transfer of T-DNA into Agrobacterium; ori-V: 
plasmid origin of replication in Agrobacterium; rop: 
coding sequence for repressor of primer; ori-322: 
origin of replication in E.coli; Spc/Str: coding region 
5 for Tn7 adenylyltransf erase (AAD ( 3 " ) ) conferring 

resistance to spectinomycin and streptomycin; Right 
Border: right border sequence of T-DNA essential for 
integration into Agrobacterium} P-e35S: enhanced 
cauliflower mosaic virus promoter; Arabidopsis HMGR1 : 

10 cDNA sequence encoding full-length form of Arabidopsis 
HMGR1; E9 3': 3' end of pea rbcS E9 gene. 

Figure 14 is a map showing the structure of 
construct pMON53734. pMON53734 is a recombinant binary 
vector carrying the cDNA encoding catalytic domain with 

15 linker region of Arabidopsis hydroxymethyl glutaryl CoA 
reductasel (HMGR1) in sense orientation driven by the 
enhanced cauliflower mosaic virus 35S promoter. P-35S: 
35S promoter from cauliflower mosaic virus; kan: 
confers resistance to neomycin and kanamycin; NOS 3': 

20 3' termination end of nopaline synthase coding region; 
Left border: octopine left border, sequence essential 
for transfer of T-DNA into Agrobacterium; ori-V: 
plasmid origin of replication in Agrobacterium; rop: 
coding sequence for repressor of primer; ori-322: 

25 origin of replication in E.coli; Spc/Str: coding region 
for Tn7 adenylyltransf erase (AAD (3")) conferring 
resistance to spectinomycin and streptomycin; Right 
Border: right border sequence of T-DNA essential for 
integration into Agrobacterium; P-e35S: enhanced 

30 cauliflower mosaic virus promoter; Arabidopsis tHMGRl : 
cDNA sequence encoding catalytic domain with linker 
region of Arabidopsis HMGR1 ; E9 3': 3' end of pea rbcS 
E9 gene. 
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Figure 15 is a map showing the structure of 
construct pMON53735. pMON53735 is a recombinant binary 
vector carrying the cDNA encoding catalytic domain 
without the linker region of Arabidopsis hydroxymethyl 
glutaryl CoA reductasel (HMGR1) in sense orientation 
driven by the enhanced cauliflower mosaic virus 35S 
promoter. P-35S: 35S promoter from cauliflower mosaic 
virus; kan: confers resistance to neomycin and 
kanamycin; NOS 3': 3' termination end of nopaline 
synthase coding region; Left border: octopine left 
border, sequence essential for transfer of T-DNA into 
Agrobacterium; ori-V: plasmid origin of replication in 
Agrobacterium; rop : coding sequence for repressor of 
primer; ori-322: origin of replication in E.coli; 
Spc/Str: coding region for Tn7 adenylyltransf erase 
(AAD ( 3 " ) ) conferring resistance to spectinomycin and 
streptomycin; Right Border: right border sequence of T- 
DNA essential for integration into Agrobacterium; P- 
e35S: enhanced cauliflower mosaic virus promoter; 
Arabidopsis cHMGRl : cDNA sequence encoding catalytic 
domain without the linker region of Arabidopsis HMGR1 ; 
E9 3' : 3' end of pea rbcS E9 gene. 

Figure 16 is a map showing the structure of 
construct pMON5373 6. pMON53 73 6 is a recombinant binary 
vector carrying the cDNA encoding full-length form of 
rubber (Hevea brasiliensis) hydroxymethyl glutaryl CoA 
reductasel (HMGR1) in sense orientation driven by the 
enhanced cauliflower mosaic virus 35S promoter. P-35S: 
35S promoter from cauliflower mosaic virus; kan: 
confers resistance to neomycin and kanamycin; NOS 3': 
3' termination end of nopaline synthase coding region; 
Left border: octopine left border, sequence essential 
for transfer of T-DNA into Agrobacterium; ori-V: 
plasmid origin of replication in Agrobacterium; rop: 
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coding sequence for repressor of primer; ori-322: 
origin of replication in E.coli; Spc/Str: coding region 
for Tn7 adenylyltransf erase (AAD (3 " ) ) conferring 
resistance to spect inomycin and streptomycin; Right 
5 Border: right border sequence of T-DNA essential for 
integration into Agrobacterium; P-e35S: enhanced 
cauliflower mosaic virus promoter; Hevea HMGR1 : cDNA 
sequence encoding full-length form of rubber HMGR1 ; E9 
3' : 3' end of pea rbcS E9 gene. 

10 Figure 17 is a map showing the structures of 

construct pMON53737. pMON53737 is a recombinant binary 
vector carrying the cDNA encoding catalytic domain with 
linker region of rubber {Hevea brasiliensis) 
hydroxymethyl glutaryl CoA reductasel (HMGR1) in sense 

15 orientation_driven by the enhanced cauliflower mosaic 
virus 35S promoter. P-35S: 35S promoter from 
cauliflower mosaic virus; kan: confers resistance to 
neomycin and kanamycin; NOS 3': 3' termination end of 
nopaline synthase coding region; Left border: octopine 

20 left border, sequence essential for transfer of T-DNA 
into Agrobacterium; ori-V: plasmid origin of 
replication in Agrobacterium; rop: coding sequence for 
repressor of primer; ori-322 : origin of replication in 
E.coli; Spc/Str: coding region for Tn7 

25 adenylyltransf erase (AAD (3")) conferring resistance to 
spect inomycin and streptomycin; Right Border: right 
border sequence of T-DNA essential for integration into 
Agrobacterium; P-e35S: enhanced cauliflower mosaic 
virus promoter; rubber tHMGRl : cDNA sequence encoding 

30 catalytic domain with linker region of rubber HMGR1 ; E9 
3 ' : 3 ' end of pea rbcS E9 gene . 

Figure 18 is a map showing the structure of 
construct pMON53738. pMON53738 is a recombinant binary 
vector carrying the cDNA encoding mutant form of rubber 
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{Hevea bra.silien.sis) hydroxymethyl glutaryl CoA 
reductasel (HMGR1) in sense orientation driven by the 
enhanced cauliflower mosaic virus 35S promoter. In the 
mutant rubber HMGR1 the putative phosphorylation site, 
5 the serine amino acid residue at position 566 is 

changed to alanine amino acid residue (SEQ ID 23) . P- 
35S: 35S promoter from cauliflower mosaic virus; kan: 
confers resistance to neomycin and kanamycin; NOS 3': 
3' termination end of nopaline synthase coding region; 

10 Left border: octopine left border, sequence essential 
for transfer of T-DNA into Agrobacterium; ori-V: 
plasmid origin of replication in Agrobacterium; rop: 
coding sequence for repressor of primer; ori-322: 
origin of replication in E.coli; Spc/Str: coding region 

15 for Tn7 adenylyltransf erase (AAD (3 " ) ) conferring 

resistance to spect inomycin and streptomycin; Right 
Border: right border sequence of T-DNA essential for 
integration into Agrobacterium; P-e35S: enhanced 
cauliflower mosaic virus promoter; rubber tHMGRl Ala 

20 566: cDNA sequence encoding catalytic domain with 
linker region of rubber HMGR1 in which serine amino 
acid residue at position 566 is changed to alanine 
amino acid residue using site directed mutagenesis; E9 
3' : 3' end of pea rbcS E9 gene. 

25 Figure 19 is a map showing the structure of 

construct pMON53739. pMON53739 is a recombinant binary 
vector carrying the cDNA encoding mutant form of rubber 
{Hevea bras Hi ens is) hydroxymethyl glutaryl CoA 
reductasel (HMGR1) in sense orientation driven by the 

30 enhanced cauliflower mosaic virus 35S promoter. In the 
mutant rubber HMGR1 the putative phosphorylation site, 
the serine amino acid residue at position 567 is 
changed to alanine amino acid residue (SEQ ID 24) . P- 
35S: 35S promoter from cauliflower mosaic virus; kan: 
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confers resistance to neomycin and kanamycin; NOS 3': 
3' termination end of nopaline synthase coding region; 
Left border: octopine left border, sequence essential 
for transfer of T-DNA into Agrobacterium; ori-V: 
plasmid origin of replication in Agrobacterium; rop : 
coding sequence for repressor of primer; ori-322: 
origin of replication in E.coli; Spc/Str: coding region 
for Tn7 adenylyltransf erase (AAD (3 " ) ) conferring 
resistance to spectinomycin and streptomycin; Right 
Border: right border sequence of T-DNA essential for 
integration into Agrobacterium; P-e35S: enhanced 
cauliflower mosaic virus promoter; rubber tHMGRl Ala 
567: cDNA sequence encoding catalytic domain with 
linker region of rubber HMGR1 in which serine amino 
acid residue at position 567 is changed to alanine 
amino acid residue using site directed mutagenesis; E9 
3 ' : 3 ' end of pea rbcS E9 gene . 

Figure 2 0 is a map showing the structure of 
construct pMON53740. pMON53740 is a recombinant binary 
vector carrying the cDNA encoding catalytic domain 
without linker region of rubber (Hevea bras i liens is) 
hydroxymethyl glutaryl CoA reductasel (HMGR1) in sense 
orientation driven by the enhanced cauliflower mosaic 
virus 35S promoter. P-35S: 35S promoter from 
cauliflower mosaic virus; kan: confers resistance to 
neomycin and kanamycin; NOS 3': 3' termination end of 
nopaline synthase coding region; Left border: octopine 
left border, sequence essential for transfer of T-DNA 
into Agrobacterium; ori-V: plasmid origin of 
replication in Agrobacterium; rop: coding sequence for 
repressor of primer; ori-322: origin of replication in 
E.coli; Spc/Str: coding region for Tn7 

adenylyltransf erase (AAD ( 3 " ) ) conferring resistance to 
spectinomycin and streptomycin; Right Border: right 
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border sequence of T-DNA essential for integration into 
Agrobacterium; P-e35S: enhanced cauliflower mosaic 
virus promoter; rubber cHMGRl : cDNA sequence encoding 
catalytic domain without linker region of rubber HMGR1; 
5 E9 3' : 3' end of pea rbcS E9 gene. 

Fig. 21 is a graph comparing the cycloartenol 
content in micrograms of steroid compound per gram of 
seeds analyzed in transgenic Arabidopsis plants 
transformed with pMON53733 through pMON53 740 compared 
10 to control plants. 

Fig. 22 is a graph comparing the 24 -methylene 
cycloartenol content in micrograms of steroid compound 
per gram of seeds analyzed in transgenic Arabidopsis 
plants transformed with pMON53733 through pMON53740 
15 compared to control plants. 

Fig. 23 is a graph comparing the obtusifoliol 
content in micrograms of steroid compound per gram of 
seeds analyzed in transgenic Arabidopsis plants 
transformed with pMON53 733 through pMON53 74 0 compared 
20 to control plants. 

Fig. 24 is a graph comparing the campesterol 
content in micrograms of steroid compound per gram of 
seeds analyzed in transgenic Arabidopsis plants 
transformed with pMON53733 through pMON53740 compared 
25 to control plants. 

Fig. 25 is a graph comparing the sitosterol 
content in micrograms of steroid compound per gram of 
seeds analyzed in transgenic Arabidopsis plants 
transformed with pMON53 733 through pMON53 74 0 compared 
30 to control plants. 

Fig. 2 6 is a graph comparing the sitostanol 
content in micrograms of steroid compound per gram of 
seeds analyzed in transgenic Arabidopsis plants 
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transformed with pMON53 73 3 through pMON53 74 0 compared 
to control plants. 

Fig. 27 is a sterol profile (histogram) of 
transgenic Arabidopsis harboring different forms of 
5 rubber HMGR. 

Fig. 2 8 is a graph of the squalene, zymosterol and 
erogosterol content in micrograms of sterol per 
milligram of cell dry weight from HMGR constructs in 
yeast HMGR1 knockout mutants for constructs having full 

10 length and truncated HMG CoA reductase (HMGR) 

sequences. The truncated sequences contain substantial 
portions of the catalytic region but lack the linker 
region and the transmembrane region of HMGR. These 
sequences are derived from Arabidopsis and rubber 

15 plants. 

Fig. 2 9 is a map showing the structure of 
construct pMON43842. pMON43842 is a yeast expression 
vector carrying cDNA encoding Arabidopsis putative 
obtusifoliol C14oc-demethylase (AC002329) in sense 

20 orientation driven by the p423Gall promoter. Sc.His3: 
HIS3 region from Saccharomyces cerevisiae encoding 
imidazoleglycerol -phosphate dehydratase for histidine 
synthesis; Ori-fl: bacteriophage fl origin of 
replication; LAC: contains partial lad coding 

25 sequence, promoter Plac, promoter Pt7, promoter Pt3, KS 
polylinker, and partial lacZ coding sequence; lacZ: 
partial coding sequence for beta-d-galactosidase or 
lacZ protein; T-Sc.Cycl: a terminator from Cycl- iso- 
1- cytochrome c from Saccharomyces cerevisiae to 

30 terminate transcription; obtus . C14a . demethylase 
(AC002329) : cDNA encoding Arabidopsis putative 
obtusifoliol C14a-demethylase; P-Sc.Gall: a promoter 
from Gall- galactokinase of Saccharomyces cerevisiae to 
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direct expression with galactose induction; LacZ- 
alpha: partial coding sequence for beta-d-galactosidase 
or lacZ protein; Ori-pUC: minimum sequence required 
for a functional origin of replication, sequence 
5 downstream of this region is known to affect copy 

number when expressed in bacteria; AMP: contains the 
P3 promoter and the beta-lactamase coding sequence, 
conferring resistance to ampicillin, penicillin, and 
carbenicillin; Sc.2micron: 2 micron origin of 

10 replication. 

Fig. 3 0 is a map showing the structure of 
construct pMON43843. pMON43843 is a yeast expression 
vector carrying cDNA encoding Arabidopsis putative 
squalene epoxidase 1 (ATA506263) in sense orientation 

15 driven by the p423Gall promoter. Sc.His3: HIS3 region 
from Sac char omyces cerevisiae encoding 
imidazoleglycerol -phosphate dehydratase for histidine 
synthesis; Ori-fl: bacteriophage fl origin of 
replication; LAC: contains partial lad coding 

20 sequence, promoter Plac, promoter Pt7, promoter Pt3, KS 
polylinker, and partial lacZ coding sequence; lacZ: 
partial coding sequence for beta-d-galactosidase or 
lacZ protein; T-Sc.Cycl: a terminator from Cycl- iso- 
1- cytochrome c from Saccharomyces cerevisiae to 

25 terminates transcription; Squalene epoxidase 1 
(ATA5062 63) : cDNA encoding Arabidopsis putative 
squalene epoxidase 1 (ATA506263); P-Sc.Gall: a 
promoter from Gall- galactokinase of Saccharomyces 
cerevisiae to direct expression with galactose 

30 induction; LacZ-alpha: partial coding sequence for 

beta-d-galactosidase or lacZ protein; Ori-pUC: minimum 
sequence required for a functional origin of 
replication, sequence downstream of this region is 
known to affect copy number when expressed in bacteria; 
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AMP: contains the P3 promoter and the beta-lactamase 
coding sequence, conferring resistance to ampicillin, 
penicillin, and carbenicillin; Sc.2micron: 2 micron 
origin of replication. 
5 Fig. 31 is a map showing the structure of 

construct pMON43844. pM0N43844 is a yeast expression 
vector carrying cDNA encoding Arabidopsis putative 
squalene epoxidase 1(ATA304243) in sense orientation 
driven by the p42 3Gall promoter. Sc.His3: HIS3 region 

10 from Saccharomyces cerevisiae encoding 

imidazoleglycerol -phosphate dehydratase for histidine 
synthesis; Ori-fl: bacteriophage fl origin of 
replication; LAC: contains partial lacl coding 
sequence, promoter Plac, promoter Pt7 ; promoter Pt3 , KS 

15 polylinker, and partial lacZ coding sequence; lacZ : 
partial coding sequence for beta-d-galactosidase or 
lacZ protein; T-Sc.Cycl: a terminator from Cycl- iso- 
1- cytochrome c from Saccharomyces cerevisiae to 
terminate transcription; Arab, squalene epoxidase 1 

20 (ATA304243) : cDNA encoding Arabidopsis putative 
squalene epoxidase 1 (ATA304243); P-Sc.Gall: a 
promoter from Gall- galactokinase of Saccharomyces 
cerevisiae to direct expression with galactose 
induction; LacZ-alpha: partial coding sequence for 

25 beta-d-galactosidase or lacZ protein; Ori-pUC: minimum 
sequence required for a functional origin of 
replication, sequence downstream of this region is 
known to affect copy number when expressed in bacteria; 
AMP: contains the P3 promoter and the beta-lactamase 

30 coding sequence, conferring resistance to ampicillin, 
penicillin, and carbenicillin; Sc.2micron: 2 micron 
origin of replication. 

Fig. 32 is a comparision of known HMG CoA 
reductase amino acid sequences. ClustalW alignment of 
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forty- three non- redundant HMG-CoA reductase sequences 
to represent archaebacterial , eubacterial, fungal, 
plant and animal groups. The putative functional 
domains in the alignment marked as described below are 
5 based on the three dimnensional structure of 

Pseudomonas mevalonii HMGR (Lawrence et al. , 1995, 
Science 268:1758): boxe d - HMGCoA binding domain, light 
shade-NAD(H) binding domain, underlined consensus- 
domains involved in catalysis, * underneath consensus 

10 and boldface-key histidine residue involved in 

catalysis. The putative phosphorylation site residues 
are marked with t and boldface, and are located at the 
C- terminal region of the protein, adjacent to a highly 
conserved arginine, marked with t and boldface. Also 

15 indicated are the conserved Glu (E) , Lys (K) , and Asp 
(D) residues, marked by E , K, and D, respectively. 
These residues are thought to be critical in catalysis, 
based on the crystal structure (Tabernero et al . , 1999; 
PNAS 96 (13) :7167-71) . 

20 Appendices A through C show SEQ ID Nos : 1 through 

3, respectively. Appendices D through G show SEQ ID 
Nos 2 0 thorough 23, respectively. 

DETAILED DESCRIPTION 

25 The following detailed description is provided to 

aid those skilled in the art in practicing the present 
invention. Even so, this detailed description should 
not be construed to unduly limit the present invention 
as modifications and variations in the embodiments 

30 discussed herein can be made by those of ordinary skill 
in the art without departing from the spirit or scope 
of the present inventive discovery. 

All publications, patents, patent applications, 
databases and other references cited in this 
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application are herein incorporated by reference in 
their entirety as if each individual publication, 
patent, patent application, database or other reference 
were specifically and individually indicated to be 
5 incorporated by reference. 

We have expressed the full-length forms of the 
rubber and Arabidopsis HMGRs driven by seed-specific 
promoters in transgenic canola and soybean. We have 
demonstrated sterol over-production up to 2-4 fold 

10 higher in seeds from these transgenic plants. We also 
demonstrated a higher accumulation of pathway 
intermediates in soybean than canola. These results 
were disclosed in PCT publication WO 00/61771. 
However, we have expressed a truncated form of the 

15 Arabidopsis hmgl without the linker and membrane 
spanning domains in Arabidopsis and soybean. The 
results in Arabidopsis were similar to that 
demonstrated by Gonzalez et al . (1997) and we compared 
the sterol profiles of our transgenic plants with those 

20 produced by Gonzalez et al . , using our methods to show 
they are comparable . We found the same types of pathway 
intermediates accumulating. However, in soybean seeds 
we have demonstrated the accumulation of squalene to a 
very high level (~3mg/g seed which is around 100-fold 

25 higher than in nontransgenic controls) . This is an 
unexpected result not disclosed or suggested in the 
prior art. Squalene is a precursor for sterols and in 
soybean it appears that there is a "bottleneck" in the 
further conversion of this precursor to sterols. Thus, 

30 it appears that there could be additional ways of over- 
producing sterols in soybean to levels greater than 10- 
fold which would include combining a truncated form of 
HMGR with other genes coding for enzymes down- stream of 
squalene . 
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This opens the potential to combine other genes 
such as squalene epoxidase for further enhancing the 
levels of desirable sterols. Such a combination has 
not been disclosed or suggested in the prior art. 
Squalene expoxidase catalyzes the addition of oxygen to 
squalene which is a 30-carbon linear isoprenoid chain 
thus allowing for cyclization to form cycloartenol . 
Additional enzymes downst ream that can be also be 
manipulated are sterol methyltransf erase 1, C-4 
demethylase, C-14 demethylase, sterol methytransf erase 
2, and C-5 desaturase that would all deplete other 
pathway intermediates shown to accumulate in soybeans . 
By using such strategies it is possible to convert all 
of the squalene and other intermediates to end sterols 
such as sitosterol, stigmasterol and campesterol . Thus, 
sterol level in soybean oils can be elevated from 0.3% 
up to 3.5%. Expression of the full-length rubber HMGR 
in soybeans results in a sterol level increase up to 
2.7%. 

Enhancement of 3 -hydroxy-3 -methylglutaryl -CoA 
reductase (HMG Co-A reductase) activity in certain 
cells results in increased sterol biosynthesis. See, 
e.g. Chappell, U.S. Patent No. 5,589,619. The present 
discovery further contemplates an increase of steroid 
pathway end products such as A5 sterols and their 
stanol counterparts with a decreased accumulation of 
certain steroid pathway intermediates by also enhancing 
various specific steroid pathway enzyme activities, 
such that more of the steroid pathway intermediate 
compounds are converted to steroid pathway end 
products . 

DNA sequences encoding squalene epoxidases are 
useful for removal of squalene accumulation, genes 
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encoding sterol methyl transferase I enzymes are useful 
for removal of cylcoartenol accumulation, genes 
encoding sterol C4 -demethylase are useful for removal 
of 24 -methylene cycloartenol accumulation, genes 
encoding obtusifoliol C14a-demethylases are useful for 
removal of accumulation of obtusifoliol, genes encoding 
sterol C5-desaturases are useful for removal of 
stigmasta-7-enol accumulation, and genes encoding 
sterol methyl transferase II enzymes are useful for the 
reduction of accumulated campesterol and concomitant 
increase of sitosterol . 

Levels of sitostanol and sitostanol esters can be 
elevated further by approximately 2- to 40 -fold over 
the transgenic plants of the art having only added 
genes for HMG CoA reductase by introducing additional 
genes encoding one or more of the following sterol 
pathway enzymes: a squalene epoxidase, a sterol methyl 
transferase I, a sterol C4 -demethylase , an obtusifoliol 
C14a-demethylase, a sterol C5-desaturase, a sterol 
methyl transferase II. 

As used herein, the term "structural coding 
sequence" means a DNA sequence which encodes for a 
peptide, polypeptide, or protein which may be made by a 
cell following transcription of the DNA to mRNA, 
followed by translation to the desired peptide, 
polypeptide, or protein. 

The term "sterol" as applied to plants refers to 
any chiral tetracyclic isopentenoid which may be formed 
by cyclization of squalene oxide through the transition 
state possessing stereochemistry similar to the trans- 
syn- trans -anti -trans -anti configuration, i.e., 
protosteroid cation, and which retains a polar group at 
C-3 (hydroxyl or keto) , an all -trans -anti 
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stereochemistry in the ring system, and a side-chain 
20R-conf iguration (Parker et al . (1992) In Nes et al . , 
Eds., Regulation of Isopentenoid Metabolism, ACS 
Symposium Series No. 497, p. 110; American Chemical 
Society, Washington, D.C.). The numbering of the 
carbon atoms of a representative sterol (cholesterol) 
is shown in the following structure (FORMULA II) : 
As used herein, the term "sterol" refers to 
unsaturated hydroxyl group- containing derivatives of a 
fused, reduced ring system, cyclopenta [a] -phenanthrene, 
comprising three fused cyclohexane rings (A, B and C) 
in a phenanthrene arrangement , and a terminal 
cyclopentane ring (D) . The exemplary steroid below 
(FORMULA II) illustrates the numbering system employed 
herein in describing the location of groups and 
substituents . 




Sterols may or may not contain a C-5 to C-6 double 
bond, as this is a feature introduced late in the 
biosynthetic pathway (note Scheme 1, below) . Sterols 
contain a C 8 -C 10 side chain at the C-17 position, as 
shown above . 

The term "phytosterol , " which applies to sterols 
found uniquely in plants, refers to a sterol containing 
a C-5, and in some cases a C-22, double bond. 
Phytosterols are further characterized by alkylation of 
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the C-17 side-chain with a methyl or ethyl substituent 
at the C-24 position. Major phytosterols include, but 
are not limited to, sitosterol, stigmasterol , 
campesterol , brassicasterol , etc. Cholesterol, which 
5 lacks a C-24 methyl or ethyl side chain, is found in 
plants but is not unique thereto, and is not a 
"phytosterol " 

" Phytostanols " are saturated forms of phytosterols 
wherein the C-5 and, when present, C-22 double bond(s) 
10 is (are) reduced, and include, but are not limited to, 
sitostanol, campestanol, and 22 -dihydrobrassicastanol . 

"Phytosterol esters" and "phytostanol esters" are 
further characterized by the presence of a fatty acid 
or phenolic acid moiety rather than a hydroxyl group at 
15 the C-3 position. 

The term "steroid compounds" includes sterols, 
phytosterols, phytosterol esters, phytostanols, and 
phytostanol esters. 

The term "phytosterol compound" refers to at least 
20 one phytosterol, at least one phytosterol ester, or a 
mixture thereof . 

The term "phytostanol compound" refers to at least 
one phytostanol, at least one phytostanol ester, or a 
mixture thereof . 
25 The term "constitutive promoter" refers to a 

promoter that operates continuously in a cell, and 
which is not subject to quantitative regulation. The 
gene with which such a promoter is associated is always 
" turned on . " 

30 The terms " seed- specif ic , " " fruit - specif ic , " 

"plastid-specific, " etc., as they apply to promoters 
refer to preferential or exclusive activity of these 
promoters in these organs or organelles, respectively. 
"Preferential expression" refers to promoter activity 
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greater in the indicated organs or organelles than 
elsewhere in the plant. "Seed-specific" comprehends 
expression in the aleurone layer, endosperm, and/or 
embryo of the seed. 
5 As used herein "isolated polynucleotide" means a 

polynucleotide that is free of one or both of the 
nucleotide sequences which flank the polynucleotide in 
the naturally- occurring genome of the organism from 
which the polynucleotide is derived. The term 

10 includes, for example, a polynucleotide or fragment 
thereof that is incorporated into a vector or 
expression cassette; into an autonomously replicating 
plasmid or virus; into the genomic DNA of a prokaryote 
or eukaryote; or that exists as a separate molecule 

15 independent of other polynucleotides. It also includes 
a recombinant polynucleotide that is part of a hybrid 
polynucleotide, for example, one encoding a polypeptide 
sequence . 

As used herein "polynucleotide" and 
20 "oligonucleotide" are used interchangeably and refer 

to a polymeric (2 or more monomers) form of nucleotides 
of any length, either ribonucleotides or 

deoxyribonucleotides . Although nucleotides are usually 
joined by phosphodiester linkages, the term also 

25 includes polymeric nucleotides containing neutral amide 
backbone linkages composed of aminoethyl glycine units. 
This term refers only to the primary structure of the 
molecule. Thus, this term includes double- and single- 
stranded DNA and RNA. It also includes known types of 

30 modifications, for example, labels, methylation, 

"caps", substitution of one or more of the naturally 
occurring nucleotides with an analog, internucleot ide 
modifications such as, for example, those with 
uncharged linkages (e.g., methyl phosphonates , 



38 MTC 6783.1 

PATEWT 

phosphotriesters, phosphoamidates , carbamates, etc.), 
those containing pendant moieties, such as, for 
example, proteins (including for e.g., nucleases, 
toxins, antibodies, signal peptides, poly-L-lysine , 
etc.), those with intercalators (e.g., acridine, 
psoralen, etc.), those containing chelators (e.g., 
metals, radioactive metals, boron, oxidative metals, 
etc.), those containing alkylators, those with modified 
linkages (e.g., alpha anomeric nucleic acids, etc.), as 
well as unmodified forms of the polynucleotide. 
Polynucleotides include both sense and antisense 
strands . 

The alternative nucleotide sequences described 
above are considered to possess substantially similar 
enzymatic activity to that of the polypeptide-encoding 
polynucleotide sequences of the present invention if 
they encode polypeptides having enzymatic activity 
differing from that of any of the polypeptides encoded 
by the polynucleotide sequences of the present 
invention by about 3 0% or less, preferably by about 2 0% 
or less, and more preferably by about 10% or less when 
assayed by standard enzymatic assays. 

As used herein "effective amount" is intended to 
qualify the amount of an agent which will achieve the 
goal of a lessening in the severity and/or the 
frequency of incidence of a disease condition or 
disorder, over no treatment. 

The phrase "steroid pathway products" refers to 
the products of steroid biosynthesis produced by the 
action of one or more of squalene epoxidase enzyme, 
sterol methyl transferase I enzyme, sterol C4- 
demethylase enzyme, obtusifoliol C14a-demethylase 
enzyme, sterol C5-desaturase enzyme, and sterol methyl 
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transferase II enzyme. Specific examples of steroid 
pathway products include, but are not limited to, 
sitosterol, sitostanol, stigmasterol and stigmastanol . 
In the context of the present disclosure, a "non- 
5 transformed" plant or cell refers to a plant or cells 
which does not comprise introduced polynucleotides 
encoding a polypeptide having 3 -hydroxy- 3- 
methyulglutaryl -Coenzyme A reductase enzyme activity 
and at least one polypeptide having squalene epoxidase 

10 enzyme activity, sterol methyl transferase I enzyme 
activity, sterol C4 -demethylase enzyme activity, 
obtusifoliol C14a-demethylase enzyme activity, sterol 
C5-desaturase enzyme activity, or sterol methyl 
transferase II enzyme activity. Thus, a plant or cell 

15 that contains introduced polynucleotide sequences other 
than those above, would still be considered "non- 
transformed . " 

As used herein, "peptide" and "protein" are used 
interchangeably and mean a compound that consists of 

20 two or more amino acids that are linked by means of 
peptide bonds. 



I . Plant Steroid Biosynthesis 

To aid the reader in understanding the present 
25 invention, descriptions of the sterol compound 
biosynthetic pathway are presented below. These 
descriptions identify enzymes useful in achieving the 
modifications to the biosynthesis and accumulation of 
sterol compounds described herein, and identify sources 
30 of nucleic acid sequences encoding these enzymes. 

Various steps in the steroid compound biosynthetic 
pathway in plants are shown in Scheme 1, below. The 
numbers over the arrows refer to plant sterol compound 
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biosynthetic pathway enzymes and genes as indicated in 
Table 1. 
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Table 1 

Plant Sterol Compound Pathway Enzymes and Genes 



Enzyme Step in GenBank 

Pathway Gene ID 



Acetoacetyl -CoA thiolase 
HMG-CoA synthase 
HMG-CoA reductase 

Mevalonate kinase 
Phosphomevalonate kinase 

Mevalonate pyrophosphate 
decarboxylase 

Isopentenyl diphosphate 
isomerase 

Farnesyl pyrophosphate 
synthase 

Squalene synthase 

Squalene epoxidase 

Squalene cyclase 
Sterol C-24 
methyl transferase 

Sterol C-4 demethylase 

Cycloeucalenol - 
obtusifoliol isomerase 

Sterol C-14 demethylase 



1 
2 
3 

4 
5 



10 

11 
12 , 18 



X78116 
X83882 
X15032 
L19262 
X77793 
Not 

available 
Y14325 

U4 92 5 9 
U47324 
X75789 

AF0045 
60 

AB0168 
83 

U87266 
U71400 



13, 19 Not 
available 

14 Not 
available 

15 U74319 
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10 



Sterol C-14 reductase 

Sterol C-8 isomerase 

Sterol C-5 desaturase 
Sterol C-7 reductase 
Sterol C-24 isomerase 



17 

20 
21 
22 



Sterol C-24 reductase 



Sterol C-22 desaturase 



Sterol C-5 reductase 



23 



24 



25 
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PCT WO 
97/48793 
AF03 03 
57 

X90454 
U49398 
Klahre 
et al . 
(1998) 
Plant Cell 
10: 1677- 
1690 

Same 
as 22 
Not 

available 
wo 

00/61771 



The plant sterol compound biosynthesis pathway has 
two distinct components. The early pathway reactions, 
leading from acetyl -CoA to squalene via mevalonic acid, 
are common to other isoprenoids. The later pathway 
reactions, leading from squalene to the major plant 
sterol compounds such as sitosterol, campesterol and 
stigmasterol , are committed biosynthetic reactions. 

The early pathway reactions have been studied in 
fungi and plants (Lees et al . , Biochemistry and 
Function of Sterols, Nes and Parish, Eds., CRC Press, 
85-99 (1997) ; Newman and Chappell, Biochemistry and 
Function of Sterols, Nes and Parish, Eds., CRC Press, 
123-134 (1997); Bach et al . , Biochemistry and Function 
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of Sterols, Nes and Parish, Eds., CRC Press, 135-150 
(1997) ) . 

Acetoacetyl CoA thiolase (EC 2.3.1.9) catalyzes 
the first reported reaction, which consists of the 
formation of acetoacetyl CoA from two molecules of 
acetyl CoA (Dixon et al . , J. Steroid Biochem. Mol . 
Biol. 62: 165-171 (1997)). This enzyme has been 
purified from radish. A radish cDNA has been isolated 
by functional complementation in Saccharomyces 
cerevisiae (GeneBank Accession # X78116) . A radish 
cDNA has also been screened against a cDNA library of 
Arabidopsis thaliana (Vollack and Bach, Plant 
Physiology 111: 1097-1107 (1996)). 

HMGCoA synthase (EC 4.1.3.5) catalyzes the 
production of HMGCoA. This reaction condenses acetyl 
CoA with acetoacetyl CoA to yield HMGCoA. HMGCoA 
synthase has been purified from yeast. A plant HMGCoA 
synthase cDNA has also been isolated from Arabidopsis 
thaliana (Montamat et al . , Gene 167: 197-201 (1995)). 

HMGCoA reductase, also referred to as 3 -hydroxy- 3- 
methyglutaryl- coenzyme A (EC 1.1.1.34), catalyzes the 
reductive conversion of HMGCoA to mevalonic acid (MVA) . 
This reaction is reported to play a role in controlling 
plant isoprenoid biosynthesis (Gray, Adv. Bot. Res. 14: 
25-91 (1987); Bach et al . , Lipids 26: 637-648 (1991); 
Stermer et al . , J". Lipid Res. 35: 1133-1140 (1994). 
Plant HMGCoA reductase genes are often encoded by 
multigene families. The number of genes comprising 
each multigene family varies, depending on the species, 
ranging from two in Arabidopsis thaliana to at least 
seven in potato. Overexpression of plant HMGCoA 
reductase genes in transgenic tobacco plants has been 
reported to result in the overproduction of 
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phytosterols (Schaller et al . , Plant Physiol. 109: 761- 
770 (1995) ) . 

Mevalonate kinase (EC 2.7.1.36) catalyzes the 
phosphorylation of mevalonate to produce mevalonate 5- 
phosphate. It has been reported that mevalonate kinase 
plays a role in the control of isoprenoid biosynthesis 
(Lalitha et al . , Indian. J. Biochem. Biophys . 23: 249- 
253 (1986)). A mevalonate kinase gene from Arabidopsis 
thaliana has been cloned (GeneBank accession number 
X77793; Riou et al . , Gene 148: 293-297 (1994)). 

Phosphomevalonate kinase (EC 2.7.4.2) (MVAP 
kinase) is an enzyme associated with isoprene and 
ergosterol biosynthesis that converts mevalonate-5 - 
phosphate to mevalonate-5 -pyrophosphate utilizing ATP 
(Tsay et al . , Mol . Cell. Biol. 11: 620-631 (1991)). 

Mevalonate pyrophosphate decarboxylase ("MVAPP 
decarboxylase") (EC 4.1.1.33) catalyzes the conversion 
of mevalonate pyrophosphate to isopentenyl diphosphate 
("IPP") . The reaction is reported to be a 
decarboxylat ion/dehydration reaction which hydrolyzes 
ATP and requires Mg 2 +. a cDNA encoding Arabidopsis 
thaliana MVAPP decarboxylase has been isolated (Toth et 
al., J". Biol. Chem. 271: 7895-7898 (1996)). An 
isolated Arabidopsis thaliana MVAPP decarboxylase gene 
was reported to be able to complement the yeast MVAPP 
decarboxylase . 

Isopentenyl diphosphate isomerase ( " IPP : DMAPP" ) 
(EC 5.3.3.2) catalyzes the formation of dimethylallyl 
pyrophosphate (DMAPP) from isopentenyl pyrophosphate 
(IPP) . Plant IPP: DMAPP isomerase gene sequences have 
been reported for this enzyme. It has also been 
reported that IPP:DMAPP isomerase is involved in rubber 
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biosynthesis in a latex extract from Hevea (Tangpakdee 
et al., Phytochemi s try 4 5 : 261-267 (1997). 

Farnesyl pyrophosphate synthase (EC 2.5.1.1) is a 
prenyltransf erase which has been reported to play a 
5 role in providing polyisoprenoids for sterol compound 
biosynthesis as well as a number of other pathways (Li 
et al., Gene 17: 193-196 (1996)). Farnesyl 
pyrophosphate synthase combines DMAPP with IPP to yield 
geranyl pyrophosphate ("GPP"). The same enzyme 

10 condenses GPP with a second molecule of IPP to produce 
farnesyl pyrophosphate ("FPP"). FPP is a molecule that 
can proceed down the pathway to sterol compound 
synthesis, or that can be shuttled through other 
pathways leading to the synthesis of quinones or 

15 sesquiterpenes. 

Squalene synthase (EC 2.5.1.21) reductively 

condenses two molecules of FPP in the presence of Mg 2+ 
and NADPH to form squalene. The reaction involves a 
head-to-head condensation, and forms a stable 

20 intermediate, presqualene diphosphate. The enzyme is 
subject to sterol demand regulation similar to that of 
HMGCoA reductase. The activity of squalene synthase 
has been reported to have a regulatory effect on the 
incorporation of FPP into sterol and other compounds 

25 for which it serves as a precursor (Devarenne et al . , 
Arch. Biochem. Biophys . 349: 205-215 (1998)). 

Squalene epoxidase (EC 1.14.99.7) (also called 
squalene monooxygenase) catalyzes the conversion of 
squalene to squalene epoxide (2 , 3 -oxidosqualene) , a 

30 precursor to the initial sterol molecule in the sterol 
compound biosynthetic pathway, cycloartenol . This is 
the first reported step in the pathway where oxygen is 
required for activity. The formation of squalene 



47 MTC 67 83.1 

PATENT 

epoxide is also the last common reported step in sterol 
biosynthesis of animals, fungi, and plants. 

The later pathway of sterol compound biosynthetic 
steps starts with the cyclization of squalene epoxide 
5 and ends with the formation of 5 -24 -alky 1 sterols in 
plants . 

2 , 3 -oxidosqualene cycloartenol cyclase (EC 
5.4.99.8) (also called cycloartenol synthase) is the 
first step in the sterol compound pathway that is 

10 plant-specific. The cyclization of 2 , 3 -oxidosqualene 
leads to lanosterol in animals and fungi, while in 
plants the product is cycloartenol. Cycloartenol 
contains a 9 , 19 -eye lop ropy 1 ring. The cyclization is 
reported to proceed from the epoxy end in a chair-boat - 

15 chair-boat sequence that is mediated by a transient C- 
2 0 carbocat ionic intermediate. 

S-adenosyl-L-methionine : sterol C-24 methyl 
transferase ("SMT1") (EC 2.1.1.41) catalyzes the 
transfer of a methyl group from S-adenosyl-L-methionine 

20 to the C-24 center of the sterol side chain (Nes et al . 
(1991) J". Biol. Chem. 2 66 (23 ): 15202 -15212 ) . This is 
the first of two methyl transfer reactions that have 
been reported to be an obligatory and rate- limiting 
step of the sterol compound -producing pathway in 

25 plants. The second methyl transfer reaction occurs 
later in the pathway after the A 8 " 7 isomerase. The 
enzyme responsible for the second methyl transfer 
reaction is named SMTII (Bouvier-Nave , P. et al., 
(1997) Eur. J. Biochem. , 246: 518-529). An isoform, 

30 SMTII, catalyzes the conversion of cycloartenol to a 
A 23<24) -24 -alky 1 sterol, cyclosadol (Guo et al . (1996) 
Tetrahed. Lett. 37 (38) : 6823-6826) . 
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Sterol C-4 demethylase catalyzes the first of 
several demethylation reactions, which results in the 
removal of the two methyl groups at C-4. While in 
animals and fungi the removal of the two C-4 methyl 
groups occurs consecutively, in plants it has been 
reported that there are other steps between the first 
and second C-4 demethylations . The C-4 demethylation 
is catalyzed by a complex of microsomal enzymes 
consisting of a monooxygenase , an NAD + -dependent sterol 
4 -decarboxylase, and an NADPH- dependent 3 -ketosteroid 
reductase . 

Cycloeucalenol-obtusif oliol isomerase ("COI") 
catalyzes the opening of the cyclopropyl ring at C-9. 
The opening of the cyclopropyl ring at C-9 creates a 
double bond at C-8. 

Sterol C-14 demethylase catalyzes demethylation at 
C-14, which removes the methyl group at C-14 and 
creates a double bond at that position. In both fungi 
and animals, this is the first step in the sterol 
synthesis pathway. Sterol 14 -demethylation is mediated 
by a cytochrome P-450 complex. 

Sterol C-14 reductase catalyzes a C-14 
demethylation that results in the formation of a double 
bond at C-14 (Ellis et al . , Gen. Microbiol . 137: 2627- 
2630 (1991)) . This double bond is removed by a A 14 
reductase. The normal substrate is 4 -methyl -8 , 14 , 24 
(24 1 ) -trien-3p-ol . NADPH is the normal reductant . 

Sterol C-8 isomerase catalyzes a reaction that 
involves further modification of the tetracyclic rings 
or the side chain (Duratti et al . , Biochem. Pharmacol. 
34: 2765-2777 (1985)). The kinetics of the sterol 
isomerase-catalyzed reaction favor a A 8 6 A 7 isomerase 
reaction that produces a A 7 group. 
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Sterol C-5 desaturase catalyzes the insertion of 
the A 5 -double bond that normally occurs at the A 7 -sterol 
level, thereby forming a A 5,7 - sterol (Parks et al . , 
Lipids 30: 227-230 (1995)). The reaction has been 
5 reported to involve the stereospecif ic removal of the 
5a and 6a hydrogen atoms, biosynthetically derived from 
the 4 pro-R and 5 pro-S hydrogens of the (+) and (-) R- 
mevalonic acid, respectively. The reaction is 
obligatorily aerobic, and requires NADPH or NADH . The 
10 desaturase has been reported to be a multienzyme 
complex present in microsomes. It consists of the 
desaturase itself, cytochrome and a pyridine 

nucleotide-dependent f lavoprotein . The A 5 -desaturase is 
reported to be a mono -oxygenase that utilizes electrons 
15 derived from a reduced pyridine nucleotide via 
cytochrome b5 . 

Sterol C-7 reductase catalyzes the reduction of a 
A 7 -double bond in A 5,7 -sterols to generate the 
corresponding A 5 -sterol. It has been reported that the 

20 mechanism involves, like many other sterol enzymes, the 
formation of a carbocat ionic intermediate via 
electrophilic "attack" by a proton. 

Sterol C-24(28) isomerase catalyzes the reduction 
of a A 24<28) -A 24 , a conversion that modifies the side 

25 chain. The product is a A 24 (25) -24 -alkyl sterol. Sterol 
C-24 reductase catalyzes the reduction of the 24(25) 
double bond at C-24, which produces sitosterol. 
Recently, Klahre et al . ((1998) Plant Cell 10:1677- 
1690) discovered that both the isomerization and 

30 reduction steps are catalyzed by an enzyme coded by the 
same gene , i.e., DIM/DWF1 . 
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Sterol C-22 desaturase {EC 2.7.3.9) catalyzes the 
formation of a double bond at C-22 on the side chain. 
This formation of a double bond at C-22 on the side 
chain marks the end of the sterol compound biosynthetic 
5 pathway, and results in the formation of stigmasterol 
(Benveniste (1986) Annu. Rev. Plant Physiol. 37:275- 
308) . The C-22 desaturase in yeast, which is the 
reported final step in the biosynthesis of ergosterol 
in that organism, requires NADPH and molecular oxygen. 

10 In addition, the reaction is also reported to involve a 
cytochrome P450 that is distinct from a cytochrome P450 
participating in demethylat ion reactions (Lees et al . 
(1995) Lipids 30: 221-226). 

Phytosterols are biogenetic precursors of 

15 brassinosteroids , steroid alkaloids, steroid 

sapogenins, ecdysteroids , and steroid hormones. This 
precursor role of phytosterols is often described as a 
"metabolic" function. A common transformation of free 
sterols in tissues of vascular plants is the 

20 conjugation at the 3 -hydroxy group of sterols with 

long-chain fatty acids to form steryl esters, or with a 
sugar, usually with a single molecule of [3-D-glucose , 
to form steryl glycosides. Some of the steryl 
glycosides are additionally esterified, at the 6- 

25 hydroxy group of the sugar moiety, with long-chain 
fatty acids to form acylated steryl glycosides . 

The existence of several enzymes that are 
specifically associated with the synthesis and 
breakdown of conjugated sterols has been reported 

30 (Woj ciechowski , Physiology and Biochemistry of Sterols, 
eds. Patterson, Nes, AOCS Press, 361 (1991)). Enzymes 
involved in this process include: UDPGlc : Sterol 
glucosyltransf erase , phospho (galacto) glyceride steryl 
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glucoside acyltransf erase, and sterylglycoside and 
sterylester hydrolases. 

UDPGlc : sterol glucosyltransf erase (EC 2.4.1.173) 
catalyzes glucosylation of phytosterols by glucose 
5 transfer from UDP-glucose ("UDPGl"). The formation of 
steryl glycosides can be measured using UDP- [ 14 C] glucose 
as the substrate. Despite certain differences in their 
specificity patterns, all reported UDPGlc : sterol 
glucosyltransf erases preferentially glucosylate only 

10 sterols or sterol-like molecules that contain a C-3 
hydroxy group, a p-conf iguration, and which exhibit a 
planar ring. It has been reported that UDPGlc : sterol 
glucosyltransf erases are localized in the microsomes. 
Phospho (galacto) glyceride steryl glucoside 

15 acyltransf erase catalyzes the formation of acylated 
steryl glycosides from the substrate steryl glycoside 
by transfer of acyl groups from some membranous polar 
acyllipids to steryl glycoside molecules. 

Acylglycerol : sterol acyltransf erase (EC 2.3.1.26) 

20 catalyzes the reaction wherein certain acylglycerols 

act as acyl donors in a phytosterol esterif ication . In 
plants, the activity of acylglycerol : sterol 
acyltransf erase is reported to be associated with 
membranous fractions. A pronounced specificity for 

25 shorter chain unsaturated fatty acids was reported for 
all acyltransf erase preparations studied in plants. 
For example, acylglycerol : sterol acyltransf erases from 
spinach leaves and mustard roots can esterify a number 
of phytosterols. 

30 Sterylglycoside and sterylester hydrolases ("SG- 

hydrolases") catalyze the enzymatic hydrolysis of 
sterylglycosides to form free sterols. The SG- 
hydrolase activity is not found in mature, ungerminated 
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seeds, is reported to emerge only after the third day 
of germination, and is found mainly in the cotyledons. 
It has been reported that phospho (galacto) glyceride : SG 
acyltranaf erase may catalyze a reversible reaction. 
5 Enzymatic hydrolysis of sterylesters in germinating 

seeds of mustard, barley and corn is reported to be low 
in dormant seeds, but increases during the first ten 
days of germination. This activity is consistent with a 
decrease in sterylesters and an increase in free 
10 sterols over the same temporal period. 



II. Processes for Modifying Steroid Compound 
Biosynthesis and Accumulation 

In order to obtain seed producing oil containing 

15 elevated levels of phytostanols and phytostanol esters 
such as sitostanol and sitostanol esters, these 
recombinant constructs or expression cassettes can be 
introduced into plant cells by any number of 
conventional means known in the art and regenerated 

20 into fertile transgenic plants. The genome of such 
plants can then comprise introduced DNA encoding 
various steroid pathway enzymes, alone or in 
combination, that achieves the desirable effect of 
enhancing the levels of phytostanols, phytostanol 

25 esters, mixtures thereof in the oil of seed thereof. 

Preferably, the genome can comprise introduced DNA 
encoding a HMG CoA reductase enzyme and an introduced 
DNA encoding one or more of a squalene epoxidase, a 
sterol methyl transferase I, a sterol C4 -demethylase , 

30 an obtusifoliol C14a-demethylase , a sterol C5- 

desaturase, a sterol methyl transferase II. In each 
case, the foregoing introduced DNAs can be operatively 
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linked to regulatory signals that cause seed-specific 
expression thereof. 

The present invention encompasses not only such 
transgenic plants, but also transformed plant cells, 
5 including cells and seed of such plants, as well as 
progeny of such plants, for example produced from the 
seed. Transformed plant cells and cells of the 
transgenic plants encompassed herein can be grown in 
culture for a time and under appropriate conditions to 

10 produce oil containing elevated levels of phytosterols 
and/or phytostanols and their corresponding esters. 
Alternatively, the phytosterols, phytostanols, and 
their corresponding esters can be isolated directly 
from the cultures. 

15 In addition, of course, seed obtained from the 

transgenic, progeny, hybrid, etc., plants disclosed 
herein can be used in methods for obtaining oil 
containing phytosterols, phytosterol esters, 
phytostanols, phytostanol esters, or mixtures thereof 

20 employing extraction and processing procedures known in 
the art. Note, in this regard, Kochhar (1983) Progr. 
Lipid Res. 22: 161-188. 

The present invention also encompasses a method of 
producing a plant that accumulates an elevated level of 

25 sitosterol, at least one sitosterol ester, sitostanol, 
at least one sitostanol ester, or mixtures thereof, in 
seeds thereof compared to seeds of a corresponding 
plant comprising no introduced DNA encoding a 
polypeptide or protein that affects the biosynthesis of 

30 sterols, phytosterols, phytosterol esters, 

phytostanols, phytostanol esters, or combinations 
thereof, comprising sexually crossing a transgenic 
plant of the present invention with such a 
corresponding plant. The latter can be a non- 
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transgenic plant, or a transgenic plant containing 
introduced DNA encoding a trait other than one 
affecting sterol, phytosterol, etc., biosynthesis. For 
example, such trait may be insect or herbicide 
5 resistance. Plants produced by this method also form 
part of the present invention. 

Also included are plants that accumulate an 
elevated level of sitosterol, at least one sitosterol 
ester, sitostanol, at least one sitostanol ester, or 

10 mixtures thereof, in seeds thereof compared to seeds of 
a corresponding plant comprising no introduced DNA 
encoding a polypeptide or protein that affects the 
biosynthesis of sterols, phytosterols , phytosterol 
esters, phytostanols, phytostanol esters, or 

15 combinations thereof, which are apomictic. 

A process of increasing the formation of steroid 
pathway products in a transformed host cell as compared 
to an otherwise identical non-transformed host cell 
comprising the following steps. A host cell is 

20 transformed with a recombinant vector comprising (a) as 
operably linked components in the 5' to 3' direction, a 
promoter, a DNA sequence encoding a polypeptide 
exhibiting 3 -hydroxy-3 -methylglutaryl -Coenzyme A 
reductase enzyme activity, and a transcription 

25 termination signal sequence; and (b) as operably linked 
components in the 5 ' to 3 ' direction, a promoter, a DNA 
sequence encoding a steroid pathway enzyme, and a 
transcription termination signal sequence. The steroid 
pathway enzyme is a squalene epoxidase enzyme, a sterol 

30 methyl transferase I enzyme, a sterol C4 -demethylase 
enzyme, a obtusifoliol C14a-demethylase enzyme, a 
sterol C5-desaturase enzyme, and a sterol methyl 
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transferase II enzyme. The transformed plant cell is 
regenerated into a transgenic plant. 

A plant contemplated by this invention is a 
vascular, multicellular higher plant. Such higher 
5 plants will hereinafter by usually referred to simply 
as "plants". Such "plants" include both complete 
entities having leaves, stems, seeds, roots and the 
like as well as callus and cell cultures that are 
monocotyledonous and dicotyledonous. Dicotyledonous 
10 plants are a preferred embodiment of the present 
invention . 

Preferred plants are members of the Solanaceae, 
Leguminosae, Ammiaceae, Brassicaceae , Gramineae, 
Carduaceae and Malvaceae families. Exemplary plant 

15 members of those families are tobacco, petunia and 

tomato (Solanaceae) , soybean and alfalfa (Leguminosae) , 
carrot (Ammiaceae) , corn, maize and barley (Gramineae) , 
Arabidopsis (Brassicaceae) , guayule (Carduaceae) , and 
cotton (Malvaceae) . A preferred plant is tobacco of 

20 the strain Nicotiana tabacum (N. Tabacum) , cotton of 
the strain Coker line 312 -5A, soybean of the strain 
Glycine max, alfalfa of the strain RYSI or tomato of 
the strain Lycopersicon esculentium. Other plants 
include canola, maize and rape. 

25 A transgenic plant contemplated by this invention 

is produced by transforming a plant cell or protoplast 
with an added, exogenous structural gene that encodes a 
polypeptide having HMG-CoA reductase activity and an 
exogenous structural gene that encodes at least one 

30 polypeptide have steroid pathway enzyme activity to 
produce a transformed plant cell, and regenerating a 
transgenic plant form the transformed plant cell . The 
encoded polypeptide is expressed both in the 
transformed plant cell or protoplast and the resulting 
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transgenic plant. (The phrase "plant cell" will 
hereinafter be used to include a plant protoplast, 
except where plant protoplasts are specifically 
discussed) . 

5 A non- transgenic plant that serves as the source 

of the plant cell that is transformed, i.e. the 
precursor cell, is referred to herein as a "native, 
non-transgenic" plant. The native, non-transgenic 
plant is of the same strain as the formed transgenic 
10 plant. 

Sterol production in a transgenic plant of the 
present invention is increased by increasing the 
activity of the enzyme HMG-CoA reductase, which enzyme 
catalyzes the conversion of 3 -hydroxy- 3 -methylglutaryl 

15 Coenzyme A (HMG-CoA) to mevalonate and the activity of 
at least one other steroid pathway enzyme . As used 
herein, the term "specific activity" means the activity 
normalized to cellular protein content. 

HMG-CoA reductase activity is increased by 

20 increasing the amount (copy number) of a gene encoding 
a polypeptide having HMG-CoA reductase catalytic 
activity. Expression of the increased amount of that 
encoded structural gene enhances the activity of that 
enzyme . 

25 The amount of the expressed gene is increased by 

transforming a plant cell with a recombinant DNA 
molecule comprising a vector operatively linked to a 
DNA segment that encodes a polypeptide having HMG-CoA 
reductase activitiy, and a promoter suitable for 

30 driving the expression of that polypeptide in that 
plant cell, and culturing the transformed plant cell 
into a transgenic plant. Such a polypeptide includes 
intact as well as a catalyt ically active, truncated 
HMG-CoA reductase proteins. 



57 MTC 6783.1 

PATENT 

Thus, a transformed plant cell and a transgenic 
plant have one or more added, exogenous genes that 
encode a polypeptide having HMG-CoA reductase activity 
and at least one other steroid pathway enzyme activty 
5 relative to a native, non- transgenic plant or 

untrans formed plant cell of the same type. As such, a 
transformed plant cell or transgenic plant can be 
distinguished from an untransf ormed plant cell or 
native, nontransgenic plant by standard technology such 

10 as agarose separation of DNA fragments or mRNAs 

followed by transfer and appropriate blotting with DNA 
or RNA, e.g., Southern or Northern blotting, or by use 
of polymerase chain reaction technology, as are well 
known. Relative HMG-CoA reductase activity of the 

15 transformed cell or transgenic plant with untransf ormed 
cells and native, non-transgenic plants or cell 
cultures therefrom can also be compared, with a 
relative activity for that enzyme of about 1.5:1 for 
transgenic (transformed) to native (untransf ormed) 

20 showing transformation. Higher relative activity 
ratios such as about 15:1 have also been observed. 

Sterol accumulation can also be used to 
distinguish between native, non-transgenic and 
transgenic plants. A transgenic plant has at least 

25 about twice the total sterol content as a native, non- 
transgenic plant where a single added gene is present. 
Greater differences up to about forty- fold have also 
been observed. 

Sitostanol, sitostanol ester, and tocopherol 

30 biosynthesis and accumulation in plants can be modified 
in accordance with the present invention by variously 
expressing the nucleic acid coding sequences discussed 
above, alone or in combination, as described herein. 
The expression of sequences encoding sterol 
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methyltransferase II enzymes facilitates the production 
of plants in which the biosynthesis and accumulation of 
campesterol, campestanol, and their esters can be 
reduced as these enzymes shunt sterol intermediates 
5 away from campesterol, and toward sitosterol and 
sitostanol . 

Ill . DNA Encoding Useful Polypeptides 

The present invention contemplates a recombinant 

10 construct or a recombinant vector that contains a DNA 

sequence encoding a polypeptide exhibiting 3-hydroxy-3- 
methylglutaryl -Coenzyme A (HMG-CoA) reductase activity 
and a DNA sequence encoding a polypeptide exhibiting 
the activity of a steroid pathway enzyme. Each 

15 polypeptide-encoding DNA sequence is operably linked in 
the 5 1 to 3 ' direction independent of the other 
sequence. Each DNA sequence in the 5' to 3' direction 
comprises a promoter, then the DNA sequence encoding 
the polypeptide then a transcription termination signal 

20 sequence. The steroid pathway enzyme is a squalene 

epoxidase enzyme, a sterol methyl transferase I enzyme, 
a sterol C4 -demethylase enzyme, a obtusifoliol C14a- 
demethylase enzyme, a sterol C5-desaturase enzyme, or a 
sterol methyl transferase II enzyme. It is 

25 contemplated that HMG-CoA reductase and steroid pathway 
enzyme activities come from a mutant or truncated form 
of those enzymes, such as a truncated HMG-CoA reductase 
lacking the transmembrane region while retaining a 
functional catalytic domain. Several HMG CoA reductase 

30 sequences are known in the art. An amino acid 

alignment for these is shown in FIG. 32. The sources 
of the sequences used in building the multiple 
alignment are listed in Table 5. 
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Table 5. Sources of Sequences Used In Building 
The Multiple Alignment 
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methanococ 

halobacter 

sulfolobus 

yeast2 
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swissprot:hmdh_orysa 
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swissprot:h md h_nicsy 
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sp_plant:q43826 
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swissprot:hmdh_crigr 
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swissprot:hmdh_rat 

swissprot:hmdh_rabit 

gp_pri2:humhmgcoa_1 

gp_rod:mushmgcoa_1 

swissprot:hmdh_xenla 

swissprot:hmdh_strpu 

swissprot:h md hblage 

swissprot:hmdh_drome 

swissprot:hmd1_dicdi 

swissprot:hmdh_schma 

swissprotih mdti_arcfu 

gp_bct1 :psehmgcoa_1 
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026662 methanobacteriiim thermoautotrophicur 
Q58116 methanococcus jannaschii. 3-hydroxy-3 
Q59468 halobacterium volcanii (haloferax volcar 
008424 sulfolobus solfataricus. 3-hydroxy-3-me 
M22255 Saccharomyces cerevisiae Yeast HMG 
M22002 Saccharomyces cerevisiae Yeast HMG 
Q12649 phycomyces blakesleeanus. 3-hydroxy- 
Q12577 fusarium moniiiforme (gibberella fujikurc 
AB012603 Candida utilis Candida utilis HMG mF 
P34136 dictyostelium discoideum (slime mold). : 
hydroxymethylglutaryl-CoA reductase (NADPH) 
P4801 9 oryza sativa (rice). 3-hydroxy-3-methylg 
024594 zea mays (maize). 3-hydroxy-3-methylg 
hydroxymethylglutaryl-CoA reductase (NADPH) 
hydroxymethylglutaryl-CoA reductase (NADPH) 
from proprietary soy sequence database 
Q00583 hevea brasiliensis (para rubber tree). 3- 
Q031 63 catharanthus roseus (rosy periwinkle) (r 
P48022 lycopersicon esculentum (tomato). 3-hyi 
Q01 559 nicotiana sylvestris (wood tobacco). 3-h 
L01400 Solanum tuberosum Potato hydroxymetl 
Q43826 raphanus sativus (radish), hydroxymeth 
L19261 Arabidopsis thaliana Arabidopsis thalian 
AB021862 Cucumis melo Cucumis melo mRNA 
P29058 hevea brasiliensis (para rubber tree). 3-I 
P29057 hevea brasiliensis (para rubber tree). 3-I 
P48021 camptotheca acuminata. 3-hydroxy-3-m 
P43256 arabidopsis thaliana (mouse-ear cress). 
P00347 cricetulus griseus (Chinese hamster). 3-I 
L00183 Cricetulus sp. Hamster 3-hydroxy-3-met 
M12705 Mesocricetus auratus Syrian hamster 3 
P51639 rattus norvegicus (rat). 3-hydroxy-3-met 
Q29512 oryctolagus cuniculus (rabbit). 3-hydrox 
M1 1058 Homo sapiens Human 3-hydroxy-3-met 
M62766 Mus musculus Mouse HMG-CoA reduc 
P2071 5 xenopus laevis (african clawed frog). 3-I 
P16393 strongylocentrotus purpuratus (purple si 
P54960 blattella germanica (german cockroach) 
P14773 drosophila melanogaster (fruit fly). 3-hyc 
P341 35 dictyostelium discoideum (slime moid). ; 
P16237 schistosoma mansoni (blood fluke). 3-hi 
028538 archaeoglobusfulgidus. 3-hydroxy-3-mi 
M24015 Pseudomonas mevalonii P.mevalonii HI 



10 



These sequences, and their truncated counterparts, 
are useful in the present invention. Examples of such 
preferred HMG CoA reductases include the truncated 
rubber and Arabidopsis HMG CoA reductases disclosed 
herein. 

Other enzyme -encoding DNAs can be introduced into 
plants to elevate even further the levels of desirable 
A5 sterols and their reduced stanol counterparts as 



well as other phytosterols and tocopherols. Thus, the 
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DNA sequences contemplated for use in the present 
invention, which can be used alone or in various 
combinations as discussed below, include, but are not 
limited to, those encoding the following enzymes: 3- 

5 hydroxys tero id oxidases; steroid Sreductases ; sterol 
methyltransf erases; sterol acyltransf erases ; and S- 
adenosylmethionine- dependent a- tocopherol 
methyltransf erases . 

In each case, the sequences encoding these enzymes 

10 can comprise an expression cassette comprising, 

operably linked in the 5' to 3 'direction, a seed- 
specific promoter, the enzyme coding sequence, and a 
transcriptional termination signal sequence functional 
in a plant cell such that the enzyme is successfully 

15 expressed. For use in the methods disclosed herein, 

the recombinant constructs or expression cassettes can 
be incorporated in a vector, for example a plant 
expression vector. Such vectors can be transformed 
into host cells such as bacterial cells, for example 

20 during the preparation or modification of the 

recombinant constructs, and plant cells. Thus, the 
invention encompasses plants and seeds comprising such 
transformed plant cells. 

It will be apparent to those of skill in the art 

25 that the nucleic acid sequences set forth herein, 

either explicitly, as in the case of the sequences set 
forth above, or implicitly with respect to nucleic acid 
sequences generally known and not present herein, can 
be modified due to the built-in redundancy of the 

30 genetic code and noncritical areas of the polypeptide 
that are subject to modification and alteration. In 
this regard, the present invention contemplates allelic 
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variants of structural genes encoding a polypeptide 
having HMG-CoA reductase activity. 

The previously described DNA segments are noted as 
having a minimal length, as well as total overall 
5 length. That minimal length defines the length of a 

DNA segment having a sequence that encodes a particular 
polypeptide having HMG-CoA reductase activity. As is 
well known in the art, as long as the required DNA 
sequence is present (including start and stop signals) , 

10 additional base pairs can be present at either end of 
the segment and that segment can still be utilized to 
express the protein. This, of course, presumes the 
absence in the segment of an operatively linked DNA 
sequence that represses expression, expresses a further 

15 product that consumes the enzyme desired to be 

expressed, expresses a product other than the desired 
enzyme or otherwise interferes with the structural gene 
of the DNA segment . 

Thus, as long as the DNA segment is free of such 

20 interfering DNA sequences, a DNA segment of the 

invention can be up to 15,000 base pairs in length. 
The maximum size of a recombinant DNA molecule, 
particularly a plant integrating vector, is governed 
mostly by convenience and the vector size that can be 

25 accommodated by a host cell, once all of the minimal 
DNA sequences required for replication and expression, 
when desired, are present. Minimal vector sizes are 
well known. 

Also encompassed by the present invention are 
30 nucleotide sequences biologically functionally 

equivalent to those disclosed herein, that encode 
conservative amino acid changes within the amino acid 
sequences of the presently disclosed enzymes, producing 
"silent" changes therein. Such nucleotide sequences 
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contain corresponding base substitutions based upon the 
genetic code compared to the nucleotide sequences 
encoding the presently disclosed enzymes. Substitutes 
for an amino acid within the enzyme sequences disclosed 
5 herein is selected from other members of the class to 
which the naturally occurring amino acid belongs. 
Amino acids can be divided into the following four 
groups: (1) acidic amino acids; (2) basic amino acids; 
(3) neutral polar amino acids; and (4) neutral non- 
10 polar amino acids. Representative amino acids within 
these various groups include, but are not limited to: 
(1) acidic (negatively charged) amino acids such as 
aspartic acid and glutamic acid; (2) basic (positively 
charged) amino acids such as arginine, histidine, and 
15 lysine; (3) neutral polar amino acids such as glycine, 
serine, threonine, cysteine, cystine, tyrosine, 
asparagine, and glutamine; and (4) neutral nonpolar 
(hydrophobic) amino acids such as alanine, leucine, 
isoleucine, valine, proline, phenylalanine, tryptophan, 
20 and methionine . 

A. HMG-CoA Reductase 

The introduction of an HMG CoA reductase gene into 
a cell results in a higher carbon throughput through 

25 the steroid synthesis pathway. The introduction of a 
truncated HMG CoA reductase gene (lacking the 
transmembrane region, resulting in a soluble HMG CoA 
reductase enzyme) provides higher HMG CoA reductase 
activity and thus increased delta- 5 steroid compound 

30 production over the same case with an introduced full- 
length HMG CoA reductase gene. A useful truncated HMG 
CoA reductase nucleic acid encodes at least the 
catalytic domain. 



63 MTC 67 8 3.1 

PATENT 

Hydroxymethylglutaryl-CoA reductase is enzyme 
number 1.1.1.88, using the recommended nomenclature of 
the International Union of Biochemistry and Molecular 
Biology on the Nomenclature and Classification of 
5 Enzymes, Enzyme Nomenclature 1992 , Edwin C. Webb, ed., 
Academic Press, Inc. (San Diego, California: 1992), 
page 35. 

The present invention contemplates transforming a 
plant cell with a structural gene that encodes a 

10 polypeptide having HMG-CoA reductase activity. The 
HMG-CoA reductase enzymes of both animal and yeast 
cells comprise three distinct amino acid residue 
sequence regions, which regions are designated the 
catalytic region, the membrane -binding region and the 

15 linker region. 

The catalytic region contains the active site of 
the HMG-CoA reductase enzyme and comprises about forty 
percent of the COOH- terminal portion of intact HMG-CoA 
reductase enzyme . 

20 The membrane -binding region contains hydrophobic 

amino acid residues and comprises about fifty percent 
of the NH2 -terminal portion of intact HMG-CoA reductase 

enzyme . 

The linker region connects the catalytic and 
25 membrane -binding regions, and constitutes the remaining 
about ten percent of the intact enzyme. 

As discussed in greater detail below, only the 
catalytic region of HMG-CoA reductase is needed herein 
to provide the desired enzyme activity. Thus, an 
30 exogenous structural gene that encodes a polypeptide 
corresponding to that catalytic region is the minimal 
HMG Co A reductase gene required for transforming plant 
cells in addition to one of the steroid pathway enzymes 
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discussed below. The present invention therefore 
contemplates use of both intact and truncated 
structural genes that encode a polypeptide having HMG- 
CoA reductase activity. 
5 A structural gene encoding a polypeptide having 

HMG-CoA reductase activity can be obtained or 
constructed from a variety of sources and by a variety 
of methodologies. See, e.g. Carlson et al . , Cell, 
28:145 (1982); Rine et al . , Proc . Natl. Acad. Sci. USA, 

10 80:6750 (1983) . Exemplary of such structural genes are 
the mammalian and yeast genes encoding HMG-CoA 
reductase or the catalytic region thereof. 

The disclosures of Chappell, et al . , U.S. Patent 
No. 5,349,126, are incorporated in full herein by 

15 reference. The mammalian genome contains a single gene 
encoding HMG-CoA reductase. The nucleotide sequence of 
the hamster and human gene for HMG-CoA reductase have 
been described in Chappell et al . A composite 
nucleotide sequence of DNA corresponds to the mRNA SEQ 

20 ID NO : 1 of Chappell et al . , as well as the derived 

amino acid residue sequence SEQ ID NO : 2 of Chappell et 
al . , for hamster HMG-CoA reductase is provided in Fig. 
2 of Chappell et al , reprinted from Chin et al . , 
.Nature, 308:613 (1984). The composite nucleotide 

25 sequence of Fig. 2, SEQ ID NO : 1 of Chappell et al . , 
comprising about 4768 base pairs, includes the 
nucleotide sequence encoding the intact hamster HMG-CoA 
reductase enzyme . 

Intact hamster HMG-CoA reductase comprises about 

30 887 amino acid residues ( SEQ ID NO : 2 of Chappell et 
al . ) . A structural gene encoding an intact hamster 
HMG-CoA reductase enzyme of 887 amino acid residues 
comprises base pairs from about nucleotide position 164 
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to about nucleotide position 2824 of SEQ ID N0:1 of 
Chappell et al . 

A preferred structural gene is one that encodes a 
polypeptide corresponding to only the catalytic region 

5 of the enzyme. Two catalytically active segments of 

hamster HMG-CoA reductase have been defined. Liscura et 
al . , J. Biol. Chem. , 260(1): 522 (1985). One segment 
containing a catalytic region has an apparent molecular 
weight of 62 kDa and comprises amino acid residues from 

10 about position 373 to about position 887. A second 
segment containing a catalytic region has an apparent 
molecular weight of 53 kDa segment and comprises amino 
acid residues from about position 460 to about position 
887. The 62 kDa catalytically active segment is 

15 encoded by base pairs from about nucleotide position 
1280 to about nucleotide position 2824 of SEQ ID NO : 1 
of Chappell et al . The 53 kDa catalytically active 
segment is encoded by base pairs from about nucleotide 
position 1541 to about nucleotide position 2824 of SEQ 

20 ID NO:l of Chappell et al . 

In a preferred embodiment, the utilized structural 
gene encodes the catalytic region and at least a 
portion of the linker region of HMG-CoA reductase. The 
linker region of hamster HMG-CoA reductase comprises 

25 amino acid residues from about position 34 0 to about 
position 373 or from about position 340 to about 
position 460, depending upon how the catalytic region 
is defined. These linker regions are encoded by base 
pairs from about nucleotide position 1180 to about 

30 nucleotide position 12 83 or from about position 1180 to 
about position 1540, respectively of SEQ ID NO:l of 
Chappell et al . The structural gene encoding the 
linker region is operatively linked to the structural 
gene encoding the catalytic region. 
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In one particularly preferred embodiment, a 
structural gene encoding a catalytically active, 
truncated HMG-CoA reductase enzyme can optionally 
contain base pairs encoding a small portion of the 
5 membrane region of the enzyme. 

A structural gene encoding a polypeptide 
comprising a catalytically active, truncated or intact 
HMG-CoA reductase enzyme from other organisms such as 
yeast can also be used in accordance with the present 

10 invention. 

Yeast cells contain two genes encoding HMG-CoA 
reductase. The two yeast genes, designated HMG1 and 
HMG2 , encode two distinct forms of HMG-CoA reductase, 
designated HMG-CoA reductase 1 SEQ ID NO: 3 of Chappell 

15 et al . are presented in Fig. 3 of Chappell et al . , are 
taken from Basson et al . Mol . Cell Biol., 8(9):3797 
(1988) . The nucleotide base sequences of HMG2 SEQ ID 
NO: 5 of Chappell et al . as well as the amino acid 
residue sequence of HMG-CoA reductase 2 SEQ ID NO : 6 of 

20 Chappell et al . are set forth therein in the Sequence 
Listing , 

The entire HMG1 gene comprises about 33 60 base 
pairs SEQ ID NO : 3 of Chappell et al . Intact HMG-CoA 
reductase 1 comprises an amino acid sequence of about 

25 1054 amino acid residues SEQ ID NO : 4 of Chappell et al . 
Thus, the minimal portion of the HMG1 gene that encodes 
an intact enzyme comprises base pairs from about 
nucleotide position 121 to about position 3282 of Fig. 
3, SEQ ID NO: 3 of Chappell et al . 

30 The entire HMG2 gene comprises about 334 8 base 

pairs SEQ ID NO : 5 of Chappell et al . Intact HMG-CoA 
reductase 2 comprises about 1045 amino acid residues 
SEQ ID NO: 6 of Chappell et al . Thus, the minimal 
portion of HMG2 gene that encodes intact HMG-CoA 
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reductase 2 comprises base pairs from about nucleotide 
position 121 to about position 3255 of SEQ ID NO : 5 of 
Chappell et al . 

By analogy to the truncated hamster structural 
5 gene, structural genes encoding polypeptides comprising 
catalytically active, truncated HMG-CoA reductase 
enzymes from yeast can also be used in accordance with 
the present invention. 

The catalytic region of HMG-CoA reductase 1 

10 comprises amino acid residues from about residue 618 to 
about reside 1054: i.e., the COOH- terminus . A 
structural gene that encodes the catalytic region 
comprises base pairs from about nucleotide position 
1974 to about position 3282 of Fig. 3 of Chappell et 

15 al . 

The linker region of HMG-CoA reductase 1 comprises 
an amino acid sequence from about residue 525 to about 
residue 617. A structural gene that encodes the linker 
region comprises nucleotides from about position 1695 

20 to about position 1973 of Fig. 3 of Chappell et al . A 
structural gene encoding the linker region of the 
enzyme operatively linked to the structural gene 
encoding the catalytic region of the enzyme. 

Also by analogy to the truncated hamster gene, a 

25 truncated HMG1 gene can optionally contain nucleotide 
base pair sequences encoding a small portion of the 
membrane -binding region of the enzyme. Such a 
structural gene preferably comprises base pairs from 
about nucleotide position 121 to about position 147 and 

30 from about position 1695 to about position 3282 of Fig. 
3 of Chappell et al . 

A construct similar to those above from an 
analogous portion of yeast HMG-CoA reductase 2 can also 
be utilized. 
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A nucleic acid sequence encoding HMG-CoA reductase 
from Hevea brasiliensis has been disclosed by Chye et 
al. (1991) Plant Mol . Biol. 16: 567-577. A nucleic 
acid sequence encoding an Arabidopsis thaliana HMG-CoA 
5 reductase has been published by Caelles et al . (1989) 
Plant Mol. Biol. 13: 627-638, and is also available as 
GenBank accession number L192 61. U.S. Patents Nos . 
5,306,862 and 5,365,017 disclose additional DNA 
sequences encoding HMG-CoA reductases. 
10 The following sequences are listed by Genbank 

Accession numbers: 

026662 methanobacterium the rmoaut ot r ophi cum . 3- 
hydroxy- 3 -methylglutaryl- coenzyme a reductase (ec 
1.1.1.34) (hmg-coa reductase). 12/1998 
15 Q58116 methanococcus jannaschii. 3 -hydroxy- 3 - 

methylglutaryl -coenzyme a reductase (ec '1 . 1 . 1 . 34) (hmg- 
coa reductase) . 7/1998 

Q59468 halobacterium volcanii (haloferax 
volcanii) . 3 -hydroxy- 3 -methylglutaryl -coenzyme a 
20 reductase (ec 1.1.1.34) (hmg-coa reductase) . 7/1998 

008424 sulfolobus solf ataricus . 3-hydroxy-3- 
methylglutaryl -coenzyme a reductase (ec 1.1.1.34) (hmg- 
coa reductase) . 12/1998 

M22255 Saccharomyces cerevisiae Yeast HMG-CoA 
25 reductase (HGM2) gene, complete cds; 3 -hydroxy- 3 -methyl 
glutaryl coenzyme A reductase. 4/1993 

M22 002 Saccharomyces cerevisiae Yeast HMG-CoA 
reductase (HGM1) gene, complete cds; 3-hydroxy-3- 
methyl -glutaryl coenzyme A reductase. 4/1993 
30 Q12 64 9 phycomyces blakesleeanus . 3-hydroxy-3- 

methylglutaryl- coenzyme a reductase (ec 1.1.1.34) (hmg- 
coa reductase) (fragment) . 11/1997 



69 MTC 67 83.1 

PATENT 

Q12577 fusarium monilif orme (gibberella 
fujikuroi). 3 -hydroxy- 3 -methylglutaryl- coenzyme a 
reductase (ec 1.1.1.34) (hmg- coa reductase). 11/1997 

AB012603 Candida utilis Candida utilis HMG mRNA 
5 for HMG-CoA reductase, complete cds . 7/1998 

P34136 dictyostelium discoideum (slime mold) . 3- 
hydroxy- 3 -methylglutaryl -coenzyme a reductase 2 (ec 
1.1.1.34) (hmg- coa reductase 2) (fragment) .35735 

PQ0 761 hydroxymethylglutaryl-CoA reductase (NADPH) 
10 (EC 1.1.1.34) (HMGR 10) - wheat (fragment) 

P48019 oryza sativa (rice) . 3-hydroxy-3- 
methylglutaryl -coenzyme a reductase (ec 1.1.1.34) (hmg- 
coa reductase) (fragment) . 2/1996 

024 5 94 zea mays (maize) . 3 -hydroxy- 3- 
15 methylglutaryl coenzyme a reductase (ec 1.1.1.88) . 
5/1999 

PQ0763 hydroxymethylglutaryl-CoA reductase (NADPH) 
(EC 1.1.1.34) (HMGR 23) - wheat (fragment) 

PQ0 7 62 hydroxymethylglutaryl-CoA reductase (NADPH) 
20 (EC 1.1.1.34) (HMGR 18) - wheat (fragment) 

from proprietary soy sequence database 

Q00583 hevea brasiliensis (para rubber tree) . 3- 
hydroxy- 3 -methylglutaryl -coenzyme a reductase 3 (ec 
1.1.1.34) (hmg- coa reductase 3). 7/1998 
25 Q03163 catharanthus roseus (rosy periwinkle) 

(madagascar periwinkle). 3 -hydroxy-3 -methylglutaryl - 
coenzyme a reductase (ec 1.1.1.34) (hmg-coa reductase). 
7/1998 

P48022 lycopersicon esculentum (tomato) . 3- 
30 hydroxy-3 -methylglutaryl -coenzyme a reductase 2 (ec 
1.1.1.34) (hmg- coa reductase 2). 7/1998 

Q01559 nicotiana sylvestris (wood tobacco) . 3- 
hydroxy- 3 -methylglutaryl -coenzyme a reductase (ec 
1.1.1.34) (hmg-coa reductase). 7/1998 
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L014 0 0 Solanum tuberosum Potato 
hydroxymethylglutaryl coenzyme A reductase (hmgr) mRNA, 
complete cds; putative. 4/1996 

Q43 82 6 raphanus sativus (radish) . 
5 hydroxymethylglutaryl -coa reductase (ec 1.1.1.34) 
( hydroxymethylglutaryl -coa reductase (nadph) ) (3- 
hydroxy- 3 -methylglutaryl -coenzyme a red 

L19261 Arabidopsis thaliana Arabidopsis thaliana 
HMG-cOA reductase gene, complete cds. 4/1994 
10 AB0218 62 Cucumis melo Cucumis melo mRNA for HMG- 

CoA reductase, complete cds; putative. 1/1999 

P29058 hevea brasiliensis (para rubber tree) . 3- 
hydroxy- 3 -methylglutaryl -coenzyme a reductase 2 (ec 
1.1.1.34) (hmg- coa reductase 2) (f ragment) . 35735 
15 P2 9057 hevea brasiliensis (para rubber tree) . 3- 

hydroxy- 3 -methylglutaryl -coenzyme a reductase 1 (ec 
1.1.1.34) (hmg- coa reductase 1). 7/1998 

P48021 camptotheca acuminata. 3-hydroxy-3- 
methylglutaryl -coenzyme a reductase (ec 1.1.1.34) (hmg- 
20 coa reductase) . 11/1997 

P43256 arabidopsis thaliana (mouse-ear cress) . 3- 
hydroxy-3 -methylglutaryl -coenzyme a reductase 2 (ec 
1.1.1.34) (hmg- coa reductase 2) (hmgr2). 7/1998 

P00347 cricetulus griseus (Chinese hamster) . 3- 
25 hydroxy- 3 -methylglutaryl -coenzyme A reductase (ec 
1.1.1.34) (hmg-coA reductase). 11/1997 

L00183 Cricetulus sp . Hamster 3-hydroxy-3- 
methylglutaryl coenzyme A (HMG CoA) reductase gene, 
exons 19 and 2 0; 3 -hydroxy- 3 -methylglutaryl coenzyme A 
30 (HMG CoA) . 4/1993 

M12705 Mesocricetus auratus Syrian hamster 3- 
hydroxy-3 -methylglutaral coenzyme A reductase (HMG- CoA 
reductase) mRNA, complete cds; 3-hydroxy-3- 
methylglutaral coenzyme A red 
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P51639 rattus norvegicus (rat) . 3 -hydroxy-3 - 
methylglutaryl- coenzyme a reductase (ec 1.1.1.34) (hmg- 
coa reductase) . 12/1998 

Q29512 oryctolagus cuniculus (rabbit) . 3-hydroxy- 
5 3 -methylglutaryl -coenzyme a reductase (ec 1.1.1.34) 
(hmg-coa reductase) . 7/1999 

M11058 Homo sapiens Human 3 -hydroxy-3 - 
methylglutaryl coenzyme A reductase mRNA, complete cds; 
3 -hydroxy- 3 -methylglutaryl coenzyme A reductase.ll/1994 
10 M62766 Mus musculus Mouse HMG-CoA reductase mRNA, 

3' end. 4/1993 

P20715 xenopus laevis (african clawed frog) . 3- 
hydroxy- 3 -methylglutaryl -coenzyme a reductase (ec 
1.1.1.34) (hmg-coa reductase). 11/1997 
15 P163 93 strongylocentrotus purpuratus (purple sea 

urchin) . 3 -hydroxy-3 -methylglutaryl -coenzyme a 
reductase (ec 1.1.1.34) (hmg-coa reductase). 11/1997 

P54960 blattella germanica (german cockroach). 3- 
hydroxy- 3 -methylglutaryl -coenzyme a reductase (ec 
20 1.1.1.34) (hmg-coa reductase). 11/1997 

P14773 drosophila melanogaster (fruit fly) . 3- 
hydroxy- 3 -methylglutaryl -coenzyme a reductase (ec 
1.1.1.34) (hmg-coa reductase). 12/1998 

P34135 dictyostelium discoideum (slime mold) . 3- 
25 hydroxy-3 -methylglutaryl -coenzyme a reductase 1 (ec 
1.1.1.34) (hmg- coa reductase 1). 11/1997 

P16237 schistosoma mansoni (blood fluke) . 3- 
hydroxy- 3 -methylglutaryl -coenzyme a reductase (ec 
1.1.1.34) (hmg-coa reductase). 7/1998 
30 028538 archaeoglobus f ulgidus . 3 -hydroxy-3 - 

methylglutaryl -coenzyme a reductase (ec 1.1.1.34) (hmg- 
coa reductase) . 12/1998 
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M24015 Pseudomonas mevalonii P.mevalonii HMG-CoA 
reductase (mvaA) gene, complete cds; HMG-CoA reductase 
(EC 1.1.1.88) . 4/1993 

B . Steroid Pathway Enzymes 

The present invention contemplates nucleic acid 
sequences encoding polypeptides having the enzyme 
activity of the steroid pathway enzymes squalene 
epoxidase, sterol methyl transferase I, sterol C4- 
demethylase, obtusifoliol C14a-demethylase, sterol C5- 
desaturase and sterol methyl transferase II. 

i . Squalene Epoxidase 

Squalene epoxidase (also called squalene 
monooxygenase) catalyzes the conversion of squalene to 
squalene epoxide (2 , 3 -oxidosqualene) , a precursor to 
the initial sterol molecule in phytosterol biosynthetic 
pathway, cycloartenol . This is the first reported step 
in the pathway where oxygen is required for activity. 
The formation of squalene epoxide is also the last 
common reported step in sterol biosynthesis of animals, 
fungi and plants. Recently, several homologues of 
Ara.hid.ops is and Brassica squalene epoxidase genes were 
reported (Schafer, U.A. , Reed, D.W., Hunter, D.G., Yao, 
K. , Weninger, A.M., Tsang, E.W., Reaney, M.J., 
MacKenzie, S.L., and Covello, P.S. (1999). Plant Mol . 
Biol. 39(4): 721-728). The same authors also have a PCT 
application disclosing the use of antisense technology 
with squalene epoxidase to elevate squalene levels in 
plants (WO 97/34003) . However, to date there are no 
reports on functional characterization of any plant 
squalene epoxidase gene or enzyme. 
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Squalene Epoxidase, also known as squalene 
monooxygenase is enzyme reference number 1.14.99.7, 
Enzyme Nomenclature 1992 , p. 146. 

Several squalene epoxidase enzymes are known to 
5 the art. These include Arabidopsis squalene epoxidase 
protein sequence Accession No. AC004786 (SEQ ID N0:1), 
Arabidopsis squalene epoxidase Accession No. N64916 
(SEQ ID NO: 2) , and Arabidopsis squalene epoxidase 
Accession No. T44667 (SEQ ID N0:3). Japanese patent 
10 application No. 07194381 A discloses a DNA encoding a 
mammalian squalene epoxidase. 

In order to facilitate the modifications to sterol 
biosynthesis and accumulation described herein, the 
present invention also provides an isolated DNA 
15 molecule, comprising a nucleotide sequence selected 
from the group consisting of: 

(a) Arabidopsis squalene epoxidase from clone ID 
ATA506263 disclosure SEQ ID NO : 4 , clone ID ATA304243 
disclosure SEQ ID NO: 6, clone ID ATA102071 disclosure 

20 SEQ ID NO: 8, clone ATA504158 disclosure SEQ ID NO: 10, 
or the complement thereof ; 

(b) a nucleotide sequence that hybridizes to said 
nucleotide sequence of (a) under a wash stringency 
equivalent to 0 . 5X SSC to 2X SSC, 0.1% SDS, at 55-65°C, 

25 and which encodes a polypeptide having squalene 

epoxidase enzymatic activity substantially similar to 
that of the disclosed squalene epoxidase; 

(c) a nucleotide sequence encoding the same 
genetic information as said nucleotide sequence of (a) , 

30 but which is degenerate in accordance with the 
degeneracy of the genetic code; and 

(d) a nucleotide sequence encoding the same 
genetic information as said nucleotide sequence of (b) , 
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but which is degenerate in accordance with the 
degeneracy of the genetic code. 

An additional aspect of the invention is the 
recombinant constructs and vectors (pMON48343 , Fig. 3 0; 
5 pMON43844, Fig. 31) comprising nucleic acid sequences 
encoding the novel squalene epoxidase, as well as a 
method of producing the novel squalene epoxidase, 
comprising culturing a host cell transformed with the 
novel constructs or vectors for a time and under 
10 conditions conductive to the production of the squalene 
epoxidase, and recovering the squalene epoxidase 
produced thereby. 

ii . Sterol Methyl Transferase I 

15 S-adenosyl -L-methionine : sterol C-24 methyl 

transferases (SMT1 and SMT2) catalyze the transfer of a 
methyl group from a cof actor, S-adenosyl -L-methionine , 
to the C-24 center of the sterol side chain (Bach, T.J. 
and Benveniste, P. (1997), Prog. Lipid Res. 36: 197- 

20 226) . SMT in higher plant cells are responsible for 

their capability to produce a mixture of 24 -methyl and 
24-ethyl sterols (Schaffer, A., Bouvier-Nave, 
Benveniste, P., Schaller, H. (2000) Lipids 35: 263- 
269) . Functional characterization of the SMT using a 

25 yeast erg6 expression system demonstrated unambiguously 
that an SMT1 sequence encodes a cycloartenol-C24- 
methyltransf erase and a SMT2 sequence encodes a 24- 
methylene lophenol-C24 -methyltransf erase in a given 
plant species (Bouvier-Nave, P., Husselstein, T., and 

30 Benveniste, P. (1998), Eur. J. Biochem. 246: 518-529). 
Several plant genes coding for SMT1 and SMT2 have been 
reported and reviewed (Schaffer, A., Bouvier-Nave, 
Benveniste, P., Schaller, H. (2000) Lipids 35: 263- 
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269) . Transgenic plants expressing homologues of either 
SMT1 or SMT2 have been studied (Schaffer, A. , Bouvier- 
Nave, Benveniste, P., Schaller, H. (2000) Lipids 35: 
263-269) . The use of these genes to modify plant sterol 
5 composition are also covered by two Monsanto patent 
applications (WO 98/45457 and WO 00/61771) . 

Sterol methyl transferase I enzymes known in the 
art are useful in the present invention. Examplary 
sequences include the known Arabidopsis sterol methyl 

10 transferase I protein sequence Accession No. U71400 
(disclosure SEQ ID N0:19), the known tobacco sterol 
methyl transferase I protein sequence Accession No. 
U81312 (disclosure SEQ ID NO: 20) and Ricinus communis 
sterol -C-methyltransf erase, Eur. J. Biochem. , 246(2), 

15 518-529 (1997) . (Complete cds, Accession No. g2246457) . 
S-Adenosyl -L -Methionine - Sterol -C24 - 
Methyltransfera.se- -A nucleic acid sequence encoding an 
Arabidopsis thaliana S -adenosyl-L-methionine- sterol - 
C24 -methyltransf erase has been published by Husselstein 

20 et al. (1996) FEBS Letters 381: 87-92. A 24 -sterol C- 
methyltransf erase is enzyme number 2.1.1.41, Enzyme 
Nomenclature 1992 , page 160 . 



25 iii . Sterol C4 -Demethylase 

Sterol C-4 demethylase catalyses the first of 
several demethylation reactions, which results in the 
removal of the two methyl groups at C-4. While in 
animals and fungi the removal of the two C-4 methyl 

30 groups occurs consecutively, in plants it has been 

reported that there are other steps between the first 
and second C-4 demethylations (Bach, T.J. and 
Benveniste, P. (1997), Prog. Lipid Res. 36: 197-226). 



76 MTC 6783.1 

PATENT 

The C-4 demethylation is catalyzed by a complex of 
microsomal enzymes consisting of a monooxygenase , an 
NAD + - dependent sterol 4 -decarboxylase and an NADPH- 
dependent 3 -ketosteroid reductase. 

5 

iv . Obtusifoliol C14a-Demethylase 

Sterol C-14 demethylase catalyzes demethylation at 
C-14 which removes the methyl group at C-14 and creates 
a double bond at that position. In both fungi and 

10 animals, this is the first step in the sterol synthesis 
pathway. However, in higher plants, the 14a-methyl is 
removed after one C-4 methyl has disappeared. Thus, 
while lanosterol is the substrate for C-14 demethylase 
in animal and fungal cells, the plants enzyme uses 

15 obtusifoliol as substrate. Sterol 14 -demethylation is 
mediated by a cytochrome P-450 complex. The mechanism 
of 14oc-methyl removal involves two oxidation steps 
leading to an alcohol, then an aldehyde at C-2 9 and a 
further oxidative step involving a def ormylation 

20 leading to formic acid and the sterol product with a 
typical 8,14-diene (Aoyama, Y . , Yoshida, Y., Sonoda, 
Y., and Sato, Y. (1989) J. Biol. Chem. 264: 18502- 
18505) . Obtusifoliol 14oc-demethylase from Sorghum 
bicolor (L) Moench has been cloned using a gene- 

25 specific probe generated using PCR primers designed 
from an internal 14 amino acid sequence and was 
functionally expressed in E . coli (Bak, S, Kahn, R.A., 
Olsen, C.E. and Halkier, B.A. (1997) The Plant Journal 
11(2) : 191-201) . Also, Sac char omyces cerevisiae CYP51A1 

30 encoding lanosterol-14-demethylase was functionally 

expressed in tobacco (Grausem, B., Chaubet , N. , Gigot, 
C, Loper, J.C., and Benveniste, P. (1995) The Plant 
Journal 7(5): 761-770). 
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Sterol C-14 demethylase enzymes and sequences are 
known in the art. For example Sorghum bicolor 
obtusifoliol 14a-demethylase CYP51 mRNA, described in 
Plant J., 11 (2) : 191-201 (1997) (complete cds Acession 
5 No. U74319) . In order to facilitate the modifications 
to sterol biosynthesis and accumulation described 
herein, the present invention also provides an isolated 
DNA molecule, having a nucleotide sequence selected 
from the group consisting of: 
10 (a) obtusifoliol C14a-demethylase from clone ID: 

ATA101105 disclosure SEQ ID NO: 14, clone ID ATA202967 
disclosure SEQ ID NO: 15, clone ID ATA4 03 931 disclosure 
SEQ ID NO: 17, or the complement thereof; 

(b) a nucleotide sequence that hybridizes to said 
15 nucleotide sequence of (a) under a wash stringency 

equivalent to 0 . 5X SSC to 2X SSC, 0.1% SDS, at 55-65°C, 
and which encodes a polypeptide having obtusifoliol 
C14a-demethylase enzymatic activity substantially 
similar to that of the disclosed obtusifoliol C14a- 
20 deme t try 1 a s e ; 

(c) a nucleotide sequence encoding the same 
genetic information as said nucleotide sequence of (a) , 
but which is degenerate in accordance with the 
degeneracy of the genetic code; and 

25 (d) a nucleotide sequence encoding the same 

genetic information as said nucleotide sequence of (b) , 
but which is degenerate in accordance with the 
degeneracy of the genetic code. 

An additional aspect of the invention is the 

30 recombinant constructs and vectors (pMON43 842, Fig. 29) 
comprising nucleic acid sequences encoding the novel 
obtusifoliol C14a-demethylase , as well as a method of 
producing the novel obtusifoliol C14a-demethylase , 
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comprising culturing a host cell transformed with the 
novel constructs or vectors for a time and under 
conditions conductive to the production of the 
obtusifoliol C14oc-demethylase , and recovering the 
obtusifoliol C14cc-demethylase produced thereby. 

v. Sterol C5-Desaturase 

Sterol C-5 desaturase catalyzes the insertion of 
the A 5 -double bond that normally occurs at the A 7 -sterol 
level, thereby forming a A 5,7 - sterol (Parks et al . , 
Lipids 30:227-230 (1995)). The reaction has been 
reported to involve the stereospecif ic removal of the 
5a and 6a hydrogen atoms, biosynthetically derived from 
the 4 pro-R and 5 pro-S hydrogens of the (+) and (-) R- 
mevalonic acid, respectively (Goodwin, T.W. (1979) 
Annu. Rev. Plant Physiol. 30: 369-404). The reaction 
is obligatorily aerobic and requires NADPH or NADH. 
The desaturase has been reported to be a multienzyme 
complex present in microsomes. It consists of the 
desaturase itself, cytochrome b 5 and a pyridine 
nucleotide-dependent f lavoprotein . The A 5 -desaturase is 
reported to be a mono -oxygenase that utilizes electrons 
derived from a reduced pyridine nucleotide via 
cytochrome,, (Taton, M. , and Rahier, A. (1996) Arch. 
Biochem. Biophys . 325: 279-288). An Arabidopsis 
thaliana cDNA encoding a sterol -C5 -desaturase was 
cloned by functional complementation of a yeast mutant, 
erg3 defective in ERG3 , the gene encoding the sterol 
C5 -desaturase required for ergosterol biosynthesis 
(Gachotte D., Husselstein, T., Bard, M., Lacroute F . , 
and Benveniste, P. (1996) The Plant Journal 9(3): 391- 
398) . Known sterol C5 -desaturase enzymes are useful in 
the present invention, including Arabidopsis sterol C5- 
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desaturase protein sequence Accession No. X90454, 
disclosure SEQ ID NO: 22, and the Arabidopsis thaliana 
mRNA for sterol-C5-desaturase described in Plant J. 
9(3) :391-398 (1996) (complete cds Accession No. 
5 gl061037) . 

The NCBI (National Center for Biotechnology 
Information) database shows 3 7 sequences for sterol 
desaturase that are useful in the present invention. 
The following are exemplary of such sequences. From 

10 yeast: C5 sterol desaturase NP_013157 (Saccharomyces 

cerevisiae) ; hypothetical C5 sterol desaturase-f ission 
T40027 (Schizosaccharomyces pombe) ; C5 sterol 
desaturase- fission T37759 (Schizosaccharomyces pombe) ; 
C5 sterol desaturase JQ1146 (Saccharomyces cerevisiae) ; 

15 C5 sterol desaturase BAA21457 (schizosaccharomyces 
pombe) ; C5 sterol desaturase CAA22610 
(Schizosaccharomyces pombe) ; putative C5 sterol 
desaturase CAA16898 (Schizosaccharomyces pombe) ; 
probable C5 sterol desaturase 013666 (erg3 _schpo) ; C5 

20 sterol desaturase P50860 (Erg3_canga) ; C5 sterol 
desaturase P32353 (erg3_yeast) ; C5 , 6 desaturase 
AAC99343 (Candida albicans) ; C5 sterol desaturase 
BAA2 02 92 (Saccharomyces cerevisiae) ; C5 sterol 
desaturase AAB3 9844 (Saccharomyces cerevisiae) ; C5 

25 sterol desaturase AAB2 9844 (Saccharomyces cerevisiae) ; 
C5 sterol desaturase CAA643 03 (Saccharomyces 
cerevisiae) ; C5 sterol desaturase AAA34595 
(Saccharomyces cerevisiae) ; C5 sterol desaturase 
7AAA34594 (Saccharomyces cerevisiae). From plants: C5 

30 sterol desaturase S71251 (Arabidopsis thaliana) ; 

putative sterol -C5 -desaturase AAF32466 (Arabidopsis 
thaliana) ; sterol-C5-desaturase AAF32465 (Arabidopsis 
thaliana) ; putatuve sterol desaturase AAF22921 
(Arabidopsis thaliana) ; delta7 sterol C5 desaturase 
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(Arabidopsis thaliana) ; sterol C5(6) desaturase homolog 
AAD20458 (Nicotiana tabacum) ; sterol C5 desaturase 
AAD12944 (Arabidopsis thaliana); sterol C5,6 desaturase 
AAD04034 (Nicotiana tabacum) ; sterol C5 desaturase 
CAA62079 (Arabidopsis thaliana). From mammals: sterol - 
C5-desaturase (Mus musculus) BAA33730; sterol-C5- 
desaturase BAA33729 (Homo sapiens) ; lathosterol oxidase 
CAB65928 (Leishmania major) ; lathosterol oxidase 

(lathosterol 5 -desaturase) 088822 (Mus musculus) ; 
lathosterol 5 -desaturase 075845 (Homo sapiens) ; delta7 
sterol C5 desaturase AAF00544 (Homo sapiens). Others: 
fungal sterol C5 desaturase homolog BAA18 970 (Homo 
sapiens) . 

For DNA sequences encoding a sterol-C5-desaturase 
useful in the present invention, the NCBI__nucleotide 
search for "sterol desaturase" came up with 110 
sequences. The following are exemplary of such 
sequences. NC_001139 (Saccharomyces cerevisiae); 
NC_001145 (Saccharomyces cerevisiae) ; NC_001144 
(Saccharomyces cerevisiae) ; AW700015 (Physcomitrella 
patens) ; AB004539 (Schizosaccharomyces pombe) ; and 
AW596303 (Glycine max) ; AC012188 (Arabidopsis 
thaliana) . 

vi . Sterol Methyl Transferase II 
The combination of introduction of an HMG-CoA 
reductase gene along with a sterol methyl transferase 
II gene into a cell serves to reduce steroid pathway 
intermediate compound accumulation in addition to 
reducing the accumulation of 24 -methyl sterols such as 
campesterol . 

Known sterol methyl transferase II enzymes are 
useful in the present invention, including Arabidopsis 
sterol methyl transferase II protein sequence (complete 
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mRNA cds from FEBS Lett. 381(12) : 87-92 (1996) Accession 
No. X89867) , disclosure SEQ ID NO:21. 

Recombinant constructs encoding any of the 
forgoing enzymes affecting the steroid biosynthetic 
pathway can be incorporated into recombinant vectors 
comprising the recombinant constructs comprising the 
isolated DNA molecules. Such vectors can be bacterial 
or plant expression vectors. 

IV. Recombinant Constructs and Vectors 
The present invention contemplates a recombinant 
construct that contains a DNA sequence encoding a 
polypeptide exhibiting 3 -hydroxy- 3 -methyl fluaryl - 
Coenzyme A (HMG-CoA) reductase activity and a DNA 
sequence encoding a polypeptide exhibiting the activity 
of a steroid pathway enzyme. Each polypeptide-encoding 
DNA sequence is operably linked in the 5 ' to 3 ' 
direction independent of the other sequence. Each DNA 
sequence in the 5' to 3' direction comprises a 
promoter, then the DNA sequence encoding the 
polypeptide then a transcription termination signal 
sequence. The steroid pathway enzyme is a squalene 
epoxidase enzyme, a sterol methyl transferase I enzyme, 
a sterol C4 -demethylase enzyme, a obtusifoliol C14a- 
demethylase enzyme, a sterol C5-desaturase enzyme, or a 
sterol methyl transferase II enzyme. 

Preferably, the promoters in the recombinant 
construct are seed-specific promoters. In one 
embodiment, the promoter is derived from a species in a 
different order from the host cell. In other 
embodiments, the encoded HMG CoA reductase and/or 
steroid pathway enzymes is (are) from a species in a 
different order from the order that of the host cell. 
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It is contemplated that a construct comprises more 
than one of the DNA sequences encoding a steroid 
pathway enzyme . 

The invention also contemplates a recombinant 
5 vector comprising the above -described recombinant 

construct, wherein that vector is preferably a plant 
expression vector. 

A recombinant DNA molecule of the present 
invention can be produced by operatively linking a 

10 vector to a useful DNA segment discussed herein to form 
a plasmid. A vector capable of directing the 
expression of a polypeptide having HMG-CoA reductase 
activity is referred to herein as an HMG-CoA reductase 
"plant integrating vector" . 

15 Such plant integrating vectors contain control 

elements that direct and regulate expression, including 
a promoter, a marker, a terminator and insertion 
sequence (e.g. FIG. 5). The polypeptide coding genes 
are operatively linked to the plant integrating vector 

20 to allow the promoter sequence to direct RNA polymerase 
binding and expression of the desired polypeptide 
coding gene . 

Useful in expressing the polypeptide coding gene 
are promoters that are inducible, viral, synthetic, 

25 constitutive as described by Poszkowski et al . , EMBO 
J • , 3:2719 (1989) and Odell et al . , Nature, 313:810 
(1985) , and temporally regulated, spatially regulated 
and spatiotemporally regulated as given in Chau et al . , 
Science, 244:174-181 (1989). The promoter preferably 

30 comprises a promoter sequence whose function in 
regulating expression of the structural gene is 
substantially unaffected by the amount of sterol or 
squalene in the cell . As used herein, the term 
"substantially unaffected" means that the, promoter is 
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not responsive to direct feedback control by the 
sterols or squalene accumulated in transformed cells or 
transgenic plants. 

A promoter is also selected for its ability to 
direct the transformed plant cell's or transgenic 
plant's transcriptional activity to the structural gene 
encoding a polypeptide having HMG-CoA reductase 
activity. Structural genes can be driven by a variety 
of promoters in plant tissues. Promoters can be near- 
constitutive, such as the CaMV 35S promoter, or tissue 
specific or developmentally specific promoters 
affecting dicots or monocots. 

As exemplified and discussed in detail herein, 
where the near-constitutive promoter CaMV 35S is used 
to transform tobacco plants, increases in total sterol 
and squalene accumulation are found in a variety of 
transformed plant tissues (e.g. callus, leaf, seed and 
root) . Alternatively, the effects of transformation 
(e.g. increased amount of a gene coding for HMG-CoA 
reductase, increased total sterol accumulation and 
increased squalene accumulation) can be directed to 
specific plant tissues by using plant integrating 
vectors containing a tissue- specif ic promoter. 

An exemplary tissue-specific promoter is the 
Lectin promoter, which is specific for seed tissue. 
The Lectin protein in soybean seeds is encoded by a 
single gene (Lei) that is only expressed during seed 
maturation and accounts for about 2 to about 5 percent 
of total seed mRNA. The Lectin gene and seed-specific 
promoter have been fully characterized and used to 
direct seed specific expression in transgenic tobacco 
plants. See, e.g., Vodkin et al . , Cell, 34:1023 (1983) 
and Lindstrom et al . , Developmental Genetics, 11:160 
(1990) . 
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A plant integrating vector containing a structural 
gene coding for a polypeptide having HMG-CoA reductase 
activity is engineered to be under control of the 
Lectin promoter and that vector is introduced into 
5 soybean plants using a protoplast transformation 

method. E.G. Dhir et al . , Plant Cell Reports, 10:97 
(1991) . The expression of the polypeptide having HMG- 
CoA reductase activity is directed specifically to the 
seeds of the transgenic plant. In this way, a 

10 transgenic soybean seed having increased squalene 

accumulation is produced. Such seeds can then be used 
to extract oil containing enhanced levels of squalene. 
As set forth hereinafter, such squalene -enhanced oil is 
characterized by a greater thermal stability when 

15 compared to non- squalene -enhanced oil. 

In the present invention, a plant has an 
exogenously provided structural gene for HMG-CoA 
reductase and at least one of the six enumerated 
steroid pathway enzymes, a squalene epoxidase enzyme, a 

20 sterol methyl transferase I enzyme, a sterol C4- 

demethylase enzyme, a obtusifoliol C14a-demethylase 
enzyme, a sterol C5 -desaturase enzyme, or a sterol 
methyl transferase II enzyme. The plant or seed thus 
containing these added genes is contemplated, while the 

25 methods to arrive at a plant or seed according to the 
invention are open to the multitude of methods 
contemplated by a person of ordinary skill in the art. 
In particular, all of the added structural genes do not 
have to have been added at the same time, or by the 

30 same route. Thus, for example, the HMG-CoA reductase 
activity may result from a cross with a plant made 
according to a process of U.S. Patent No. 5,349,126, 
while a steroid pathway enzyme is added by nucleic acid 
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bombardment to that plant. Further, when more than one 
nucleotide sequence encoding a steroid pathway enzyme 
is present in a contemplated plant, the expression of 
the gene does not have to be under the control of the 
5 same promoter, or even the same type of promoter. 

The choice of which plant integrating vector and 
ultimately to which promoter a polypeptide coding gene 
is operatively linked depends directly on the 
functional properties desired, e.g. the location and 

10 timing of protein expression, and the host cell to be 
transformed. These are well known limitations inherent 
in the art of constructing recombinant DNA molecules. 
However, a vector useful in practicing the present 
invention is capable of directing the expression of the 

15 polypeptide coding gene, i.e., the gene encoding HMG- 
CoA reductase activity, included in the DNA segment to 
which it is operatively linked. 

Typical vectors useful for expression of genes in 
higher plants are well known in the art and include 

20 vectors derived from the tumor- inducing (Ti) plasmid of 
Agrobacterium tumefaciens , described by Rogers et al . , 
Meth. Enzymol., 153:253-277 (1987). However, several 
other plant integrating vector systems are known to 
function in plants including pCaMVCN transfer control 

25 vector described by Fromm et al . Proc . Nat. Acad. Sci . 
USA, 82:5824 (1985). Plasmid pCaMVCN (available from 
Pharmacia, Piscataway, N.J.) includes the cauliflower 
mosaic virus CaMV 35S promoter. 

The use of retroviral plant integrating vectors to 

30 form the recombinant DNAs of the present invention is 
also contemplated. As used herein, the term 
"retroviral plant integrating vector" refers to a DNA 
molecule that includes a promoter sequence derived from 
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the long terminal repeat (LTR) region of a retrovirus 
genome . 

In preferred embodiments, the vector used to 
express the polypeptide coding gene includes a 
selection marker that is effective in a plant cell, 
preferably a drug resistance selection marker. One 
preferred drug resistance marker is the gene whose 
expression results in kanamycin resistance; i.e., the 
chimeric gene containing the nopaline synthase 
promoter, Tn5 neomycin phosphotransferase II and 
nopaline synthase 3' nontranslated region described by 
Rogers et al . , in Methods for Plant Molecular Biology, 
A. Weissbach and H. Weissbach eds . , Academic Press 
Inc., San Diego, Calif. (1988). 

A variety of methods have been devoloped to 
operatively link DNA to vectors via complementary 
cohesive termini or blunt ends. For instance, 
complementary homopolymer tracts can be added to the 
DNA segment to be inserted and to the vector DNA. The 
vector and DNA segment are then joined by hydrogen 
bonding between the complementary homopolymeric tails 
to form recombinant DNA molecules. 

Alternatively, synthetic linkers containing one or 
more restriction endonuclease sites can be used to join 
the DNA segment to the plant integrating vector. The 
synthetic linkers are attached to blunt -ended DNA 
segments by incubating the blunt -ended DNA segments 
with a large excess of synthetic linker molecules in 
the presence of an enzyme that is able to catalyze the 
ligation of blunt-ended DNA molecules, such as 
bacteriophage T4 DNA ligase. Thus, the products of the 
reaction are DNA segments carrying synthetic linker 
sequences at their ends. These DNA segments are then 
cleaved with the appropriate restriction endonuclease 
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and ligated into a plant integrating vector that has 
been cleaved with an enzyme that produces termini 
compatible with those of the synthetic linker. 
Synthetic linkers containing a variety of restriction 
5 endonuclease sites are commercially available from a 
number of sources including New England BioLabs, 
Beverly, Mass. 

Also contemplated by the present invention are RNA 
equivalents of the above-described recombinant DNA 
10 molecules. 

A preferred recombinant DNA molecules utilized in 
accordance with the present invention are pMON53733- 
pMON53740 (Figures 13-20) . 

15 A . Promoters and Target Tissues 

Promoters useful in the present invention include 
those that confer appropriate cellular and temporal 
specificity of expression. Such promoters include 
those that are constitutive or inducible, 

20 environmentally- or developmentally-regulated, or 
organelle-, cell-, or tissue-specific. Preferred 
promoters for use with the present invention promote 
expression of the introduced enzymes in the seed in the 
cytosol, although expression in plasids or organelles 

25 of the seeds is also contemplated. 

Often-used constitutive promoters include the CaMV 
35S promoter (Odell et al . (1985) Nature 313: 810), the 
enhanced CaMV 3 5S promoter, the Figwort Mosaic Virus 
(FMV) promoter (Richins et al . (1987) NAR 20: 8451), 

30 the mannopine synthase (mas) promoter, the nopal ine 
synthase (nos) promoter, and the octopine synthase 
iocs) promoter. 

Useful inducible promoters include heat -shock 
promoters (Ou-Lee et al . (1986) Proc . Natl. Acad. Sci. 
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USA 83: 6815; Ainley et al . (1990) Plant Mol . Biol. 14: 
949) , a nitrate-inducible promoter derived from the 
spinach nitrite reductase gene (Back et al . (1991) 
Plant Mol. Biol. 17: 9), hormone -inducible promoters 
(Yamaguchi-Shinozaki et al . (1990) Plant Mol. Biol. 15: 
905; Kares et al . (1990) Plant Mol. Biol. 15: 905), and 
light-inducible promoters associated with the small 
subunit of RuBP carboxylase and LHCP gene families 
(Kuhlemeier et al . (1989) Plant Cell 1: 471; Feinbaum 
et al. (1991) Mol. Gen. Genet. 226: 449; Weisshaar et 
al. (1991) EMBO J. 10: 1777; Lam and Chua (1990) 
Science 248: 471; Castresana et al . (1988) EMBO J. 7: 
1929; Schulze-Lef ert et al . (1989) EMBO J. 8: 651). 

Examples of useful tissue-specific, 
developmentally-regulated promoters include fruit- 
specific promoters such as the E4 promoter (Cordes et 
al. (1989) Plant Cell 1:1025), the E8 promoter (Deikman 
et al. (1988) EMBO J. 7: 3315), the kiwifruit actinidin 
promoter (Lin et al . (1993) PNAS 90: 5939), the 2A11 
promoter (Houck et al . , U.S. Patent 4,943,674), and the 
tomato pZ130 promoter (U.S. Patents 5,175, 095 and 
5,530,185); the P-conglycinin 7S promoter (Doyle et al . 

(1986) J. Biol. Chem. 261: 9228; Slighton and Beachy 

(1987) Planta 172: 356), and seed- specif ic promoters 
(Knutzon et al . (1992) Proc . Natl. Acad. Sci. USA 89: 
2624; Bustos et al . (1991) EMBO J. 10: 1469; Lam and 
Chua (1991) J. Biol. Chem. 266: 17131; Stayton et al . 
(1991) Aust. J. Plant. Physiol. 18: 507). Fruit- 
specific gene regulation is discussed in U.S. Patent 
5,753,475. Other useful seed-specific promoters 
include, but are not limited to, the napin, phaseolin, 
zein, soybean trypsin inhibitor, 7S, ADR12, ACP, 
stearoyl-ACP desaturase, oleosin, Lasquerella 
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hydroxylase, and barley aldose reductase promoters 
(Bartels (1995) Plant J. 7: 809-822), the EA9 promoter 
(U.S. Patent 5,420,034), and the Bce4 promoter (U.S. 

Patent 5,530,194). Useful embryo- specif ic promoters 
5 include the corn globulin 1 and oleosin promoters. 

Useful endosperm-specific promoters include the rice 

glutelin-1 promoter, the promoters for the low-pi 

amylase gene (Amy32b) (Rogers et al . (1984) J". Biol. 

Chem. 259: 12234), the high-pi -amylase gene (Amy 64) 
10 (Khurseed et al . (1988) J". Biol. Chem. 263: 18953), and 

the promoter for a barley thiol protease gene 
( "Aleurain" ) (Whittier et al . (1987) Nucleic Acids Res. 

15 : 2515) . 

Appropriate target tissues of plants for enhanced 
15 production of sterol compounds such as sitosterol, 

sitosterol esters, sitostanol, sitostanol esters, and 
tocopherols, and reduced production of campesterol, 
campestanol, and esters thereof, include, but are not 
limited to, fruits, flowers, seeds, roots, tubers, 
20 leaves, stems, buds, and other vegetable parts of 

plants. Within seeds, appropriate organ compartments 
include the embryo, the endosperm, and the aleurone 
layer. Within any of the noted target tissues, 
appropriate cellular compartments include, but are not 
25 limited to, the cell cytoplasm and plastids (e.g., 

proplastids, chloroplast s , chromoplast s , leucoplasts, 
amyloplasts, etc.). 

B . Vectors 

30 In plants, transformation vectors capable of 

introducing encoding DNAs involved in sterol compound 
and tocopherol biosynthesis are easily designed, and 
generally contain one or more DNA coding sequences of 
interest under the transcriptional control of 5 ' and 3 ' 
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regulatory sequences. Such vectors generally comprise, 
operatively linked in sequence in the 5 ' to 3 ' 
direction, a promoter sequence that directs the 
transcription of a downstream heterologous structural 
5 DNA in a plant; optionally, a 5' non-translated leader 
sequence; a nucleotide sequence that encodes a protein 
of interest; and a 3' non-translated region that 
encodes a polyadenylation signal which functions in 
plant cells to cause the termination of transcription 

10 and the addition of polyadenylate nucleotides to the 3 ' 
end of the mRNA encoding the protein. Plant 
transformation vectors also generally contain a 
selectable marker. Typical 5 '-3' regulatory sequences 
include a transcription initiation start site, a 

15 ribosome binding site, an RNA processing signal, a 
transcription termination site, and/or a 
polyadenylation signal . Vectors for plant 
transformation have been reviewed in Rodriguez et al . 
(1988) Vectors: A Survey of Molecular Cloning Vectors 

20 and Their Uses, Butterworths , Boston; Glick et al . 
(1993) Methods in Plant Molecular Biology and 
Biotechnology CRC Press, Boca Raton, Fla; and Croy 
(1993) In Plant Molecular Biology Labfax, Hames and 
Rickwood, Eds., BIOS Scientific Publishers Limited, 

25 Oxford, UK. 

The use of transit peptides, e.g. translat ional 
fusion peptides, are not preferred for use in 
conjunction with the enzymes of the present invention, 
where the sterol synthethic compounds are present 

30 primarily in the cellular cytosol . 



V. Cell Transformation and Plant Regeneration 
The amount of a gene coding for a polypeptide 
having HMG-CoA reductase activity is increased by 
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transforming a desired plant cell with a suitable 
vector that contains that added exogenous structural 
gene. Expression of that gene in the transformed plant 
cell and transgenic plants developed from that 
5 transformed plant cell enhances the activity of HMG-CoA 
reductase . 

Methods for transforming polypeptide-coding genes 
into plant cells include Agrrobacterium-mediated plant 
transformation, protoplast transformation, gene 

10 transfer into pollen, injection into reproductive 

organs and injection into immature embryos. Each of 
these methods has distinct advantages and 
disadvantages. Thus, one particular methods of 
introducing genes into a particular plant strain may 

15 not necessarily be the most effective for another plant 
strain, but it is well known which methods are useful 
for a particular plant strain. 

Agrobacterium-mediated transfer is a widely 
applicable system for introducing genes into plant 

20 cells because the DNA can be introduced into whole 
plant tissues, thereby bypassing the need for 
regeneration of an intact plant from a protoplast . The 
use of Agrrobacterium-mediated plant integrating vectors 
to introduce DNA into plant cells is well known in the 

25 art. See, for example, the methods described by Fraley 
et al . , Biotechnology, 3:629 (1984) and Rogers et al . , 
Methods in Enzymology, 153:253-277 (1987). Further the 
integration of the T8-DNA is a relatively precise 
process resulting in few rearrangements . The region of 

30 DNA to be transferred is defined by the border 

sequences, and intervening DNA is usually inserted into 
the plant genome as described by Spielmann et al . , Mol . 
Gen. Genet., 2 05:34 (1986) and Jorgensen et al , Mol. 
Gen. Genet., 207:471 (1987). 
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Modern Agrobacterium transformation vectors are 
capable of replication in E . coli as well as 
Agrobacterium, allowing for convenient manipulations as 
described by Klee et al . , in Plant DNA Infectious 
Agents, T. Hohn and J. Schell, eds . , Springer- Verlag, 
New York (1985) pp. 179-203. 

Moreover, recent technological advances in vectors 
for Agrobacterium- mediated gene transfer have improved 
the arrangement of genes and restriction sites in the 
vectors to facilitate construction of vectors capable 
of expressing various polypeptide coding genes. The 
vectors described by Rogers et al . , Meth. Enzymol . , 
153:253 (1987), have convenient multi-linker regions 
flanked by a promoter and a polyadenylation site for 
directed expression inserted polypeptide conding genes 
and are suitable for present purposes. 

In addition, Agrobacteria containing both armed 
and disarmed Ti genes can be used for the 
transformations. Both types of transforming systems 
are illustrated herein. Transf ormants from the former 
system result in callus from which the desired squalene 
or sterol can be obtained, whereas transf ormants 
obtained from the latter, disarmed Ti genes can be 
regenerated into complete transgenic plants from whose 
tissues, e.g. leaf, seed and root, the desired 
chemicals can be obtained. 

In those plant strains where Agrobacterium- 
mediated transformation is efficient, it is the method 
of choice because of the facile and defined nature of 
the gene transfer. 

Agrobacteri um-mediated transformation of leaf 
disks and other tissues, such as cotyledons and 
hypocotyls, appears to be limited to plant strains that 
Agrobacaterium naturally infects. Agrobacterium- 
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mediated transformation is the most efficient in 
dicotyledonous plants. Few monocots appear to be 
natural hosts for Agrobacterium, although transgenic 
plants have been produced in the monocot , asparagus, 
using Agrobacterium vectors as described by Bytebier et 
al . , Proc. Natl. Acad. Sci . USA, 84:5345 (1987). 
Therefore, commercially important cereal grains such as 
rice, corn, and wheat must usually be transformed using 
alternative methods. However, as mentioned above, the 
transformation of asparagus using Agrobacteriuim can 
also be achieved. 

A transgenic plant formed using Agrobacterium 
transformation methods typically contains a single gene 
on one chromosome. Such transgenic plants can be 
referred to as being heterozygous for the added gene. 
However, inasmuch as use of the word "heterozygous" 
usually implies the presence of a complementary gene at 
the same locus of the second chromosome of a pair of 
chromosomes, and there is no such gene in a plant 
containing one more than one added gene as here, it is 
believed that a more accurate name for such a plant is 
an independent segregant , because the added, exogenous 
gene segregates independently during mitosis and 
meiosis. A transgenic plant containing a single 
structural gene that encodes a polypeptide having HMG- 
CoA reductase activity and at least one of the 
enumerated 6 steroid pathway enzymes; i.e., and 
independent segregant, is a preferred transgenic plant. 

More preferred is a transgenic plant that is 
homozygous for the added structural gene ; i.e., a 
transgenic plant that contains two added genes, one 
gene at the same locus on each chromosome of a 
chromosome pair. A homozygous transgenic plant can be 
obtained by sexually mating (selfing) an independent 
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segregant transgenic plant that contains the added 
genes according to the invention, germinating some of 
the seed produced and analyzing the resulting plants 
produced for enhanced HMG-CoA reductase activity, 
5 steroid pathway product accumulation or both, relative 
to a control (native, non- transgenic) or an independent 
segregant transgenic plant . 

A homozygous transgenic plant exhibits enhanced 
HMG-CoA reductase activity as compared to a native, 

10 non -transgenic plant and an independent segregant 
transgenic plant . 

It is to be understood that two different 
transgenic plants can also be mated to produce 
offspring that contain two independently segregating 

15 added, exogenous genes. Selfing of appropriate progeny 
can produce plants that are homozygous for both added, 
exogenous genes that encode a polypeptide having HMG- 
CoA reductase activity. Back-crossing to a parental 
plant and outcrossing with a non- transgenic plant are 

20 also contemplated. 



A. Host Cells and Transformed Plant Cells 
Cells modified according to the present invention 
are contemplated at each stage of the processes of the 
25 invention. As a result of the invention comprising at 
least two genes, there are several means to accomplish 
that end. In some embodiments of the invention, the 
intermediate steps include transformation of nucleic 
acids comprising some or all of the genes into host 
30 cells. 

The nucleic acid sequence encoding a polypeptide 
exhibiting HMGR activity does not have to be in the 
same orientation as a nucleic acid sequence encoding a 
polypeptide exhibiting the activity of a steroid 
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pathway enzyme. The coding nucleic acids can be under 
the control of different promoters or be in different 
orientations. For the host plant cell useful in 
carrying out the steroid compound biosynthesis 
5 according to the invention, the minimum that is 

required is the coding nucleic acids be in the same 
host cell . As long as the HMGR and a steroid pathway 
enzyme coding sequences are present in the same host 
cell, they do not have to be on the same DNA molecule 

10 or under the control of the same promoter, nor do they 
have to be derived from the same vector or construct . 

Host cells are useful for making, storing, 
reproducing or manipulating nucleic acid constructs of 
the invention. Contemplated host cells are eukaryotic 

15 cells, such as yeast or plant cells. Any plant cells 
can be utilized with the present invention. Some 
particularly useful agriculturally significant plant 
cells are canola, soybean, corn, maize, tobacco, 
cotton, rape, tomato and alfalfa. Other common plant 

20 varieties are carrot, barley, arabidopsis, guayule and 
petunia. Prokaryotic host cells containing constructs 
and/or vectors according to the invention are also 
contemplated (e.g. E. coli) . 

One embodiment of the invention is a transformed 

25 host cell containing inter alia a recombinant construct 
that encodes both a DNA sequence encoding a polypeptide 
exhibiting 3 -hydroxy- 3 -methylglutaryl -Coenzyme A 
reductase enzyme activity and a DNA sequence encoding a 
steroid pathway enzyme, where the steroid pathway 

30 enzyme is as described in detail above. In a preferred 
embodiment , those coding DNA sequences are operably 
linked to a promoter and a transcription termination 
signal sequence. In the coding sense direction of the 
construct, the components of the construct are operably 
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linked in the 5 ' to 3 ' direction as a promoter, the DNA 
sequence encoding sequence and a transcription 
termination signal sequence. 

Another embodiment of the invention is host cell 
that has been transformed with a recombinant vector 
that has such a construct. As discussed herein, in one 
embodiment of the invention, such a recombinant vector 
is a plant expression vector. Preferably such a host 
cell is a plant cell. 

Methods of culturing various eukaryotic and 
prokaryotic cell cultures are well known in the art. 
The present invention contemplates cell cultures of 
transformed host cells. Transformed plant cells 
include transformed protoplasts and other types of host 
cell intermediates as well as plant cell cultures. 

Non-limiting examples of plants that can be used 
in the practice of the invention include, acacia, 
alfalfa, aneth, apple, apricot, artichoke, arugula, 
asparagus, avocado, banana, barley, beans, beet, 
blackberry, blueberry, broccoli, brussel sprouts, 
cabbage, canola, cantaloupe, carrot, cassava, 
cauliflower, celery, cherry, chicory, cilantro, citrus, 
Clementines, coffee, corn, cotton, cucumber, Douglas 
fir, eggplant, endive, escarole, eucalyptus, fennel, 
figs, garlic, gourd, grape, grapefruit, honey dew, 
jicama, kiwifruit, lettuce, leeks, lemon, lime, 
Loblolly pine, mango, melon, mushroom, nectarine, nut, 
oat, oil palm, oil seed rape, okra, onion, orange, 
ornamental plants, papaya, parsley, pea, peach, peanut, 
pear, pepper, persimmon, pine, pineapple, plantain, 
plum, pomegranate, poplar, potato, pumpkin, quince, 
radiata pine, radicchio, radish, raspberry, rice, rye, 
sorghum, Southern pine, soybean, spinach, squash, 
strawberry, sugarbeet , sugarcane, sunflower, sweet 
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potato, sweetgum, tangerine, tea, tobacco, tomato, 
triticale, turf, turnip, watermelon, wheat, yam, and 
zucchini . 

Plants particularly attractive for the steroid 
5 pathway modifications described herein include those 
that produce carbon substrates which can be employed 
for synthesis of these compounds. Non-limiting 
examples of such plants include various monocots and 
dicots, including high oil seed plants such as high oil 

10 seed Brassica (e.g., Brassica nigra, Brassica napus , 
Brassica hirta, Brassica rapa, Brassica campestris , 
Brassica carinata, and Brassica juncea) , soybean 
(Glycine max) , castor bean (Ricinus communis) , cotton, 
saf flower {Carthamus tinctorius) , sunflower (Helianthus 

15 annuus) , flax [Linum usitatissimum) , corn (Zea mays), 
coconut (Cocos nucifera) , palm (Elaeis guineensis) , 
oilnut trees such as olive (Olea europaea) , sesame, and 
peanut (Arachis hypogaea) , as well as Arabidopsis , 
tobacco, wheat, barley, oats, amaranth, potato, rice, 

20 tomato, and legumes (e.g., peas, beans, lentils, 
alfalfa, etc.) . 

Enhancement of sitostanol compound production by 
the methods discussed herein is expected to result in 
yields of sitostanol, sitostanol esters, or mixtures 

25 thereof in an amount of at least about 57% by weight, 
preferably from about 5 7% to about 90% by weight, and 
more preferably from about 57% to about 65% by weight 
of the total sterol compounds present in seed oil. 
Expressed on a seed dry weight basis, sitostanol, 

30 sitostanol esters, or mixtures thereof are expected to 
be present in an amount of at least about 0.08%, 
preferably from about 0.08% to about 0.8%, and more 
preferably from about 0.08% to about 0.4% of seed dry 
weight . 
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B . Processes of Transformation 

A variety of different methods can be employed to 
introduce transformation/expression vectors into plant 
protoplasts, cells, callus tissue, leaf discs, 
5 meristems, etc., to generate transgenic plants. These 
methods include, for example, Agnrobacteri urn-mediated 
transformation, particle gun delivery, microinjection, 
electroporation, polyethylene glycol -mediated 
protoplast transformation, liposome-mediated 

10 transformation, etc. (reviewed in Potrykus (1991) Ann. 
Rev. Plant Physiol. Plant Mol . Biol. 42: 205). 

In general, transgenic plants comprising cells 
containing and expressing nucleic acids encoding 
enzymes facilitating the modifications in sterol 

15 compound and tocopherol biosynthesis and accumulation 
described herein can be produced by transforming plant 
cells with a DNA construct as described above via any 
of the foregoing methods; selecting plant cells that 
have been transformed on a selective medium; 

20 regenerating plant cells that have been transformed to 
produce differentiated plants; and selecting a 
transformed plant that expresses the enzyme -encoding 
nucleotide sequence (s) at a level such that the amount 
of sitosterol, sitosterol esters, sitostanol, 

25 sitostanol esters, tocopherol compound (s) , and 

campesterol/campestanol and their esters is within the 
ranges described herein. 

The encoding DNAs can be introduced either in a 
single transformation event (all necessary DNAs present 

30 on the same vector) , a co- transformation event (all 
necessary DNAs present on separate vectors that are 
introduced into plants or plant cells simultaneously) , 
or by independent transformation events (all necessary 
DNAs present on separate vectors that are introduced 
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into plants or plant cells independently) . Traditional 
breeding methods can subsequently be used to 
incorporate the desired combination of enzymes into a 
single plant, and to produce hybrid progeny of the 

5 invention plants. 

Specific methods for transforming a wide variety 
of dicots and obtaining transgenic plants are well 
documented in the literature (Gasser and Fraley (1989) 
Science 244: 1293; Fisk and Dandekar (1993) Scientia 

10 Horticulture 55: 5; Christou (1994) Agro Food Industry 
Hi Tech, p. 17; and the references cited therein) . 

Apomixis is a genetically controlled method of 
reproduction in plants where the embryo is formed 
without union of an egg and a sperm. There are three 

15 basic types of apomictic reproduction: 1) apospory 
where the embryo develops from a chromosomal ly 
unreduced egg in an embryo sac derived from the 
nucellus, 2) diplospory where the embryo develops from 
an unreduced egg in an embryo sac derived from the 

20 megaspore mother cell, and 3) adventitious embryony 

where the embryo develops directly from a somatic cell. 
In most forms of apomixis, psuedogamy or fertilization 
of the polar nuclei to produce endosperm is necessary 
for seed viability. In apospory, a "nurse" cultivar 

25 can be used as a pollen source for endosperm formation 
in seeds. The nurse cultivar does not affect the 
genetics of the aposporous apomictic cultivar since the 
unreduced egg of the cultivar develops 
parthenogenetically , but makes possible endosperm 

30 production. 

Apomixis is economically important, especially in 
transgenic plants, because it causes any genotype, no 
matter how heterozygous, to breed true. Thus, with 
apomictic reproduction, heterozygous transgenic plants 
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can maintain their genetic fidelity throughout repeated 
life cycles. Methods for the production of apomictic 
plants are known in the art. See, U.S. Patent No. 
5,811,636 and references cited therein which are herein 
5 incorporated by reference. 

The present invention also encompasses uniform 
populations of any of the plants discussed herein. 

Successful transformation and plant regeneration 
have been achieved in the monocots as follows: 

10 asparagus (Asparagus officinalis; Bytebier et al . 

(1987) Proc. Natl. Acad. Sci . USA 84: 5345); barley 
{Hordeum vulgar ae; Wan and Lemaux (1994) Plant Physiol. 
104: 37); maize (Zea mays; Rhodes et al . (1988) Science 
240: 204; Gordon-Kamm et al . (1990) Plant Cell 2: 603; 

15 Fromm et al . (1990) Bio/Technology 8: 833; Koziel et 

al . (1993) -Bio/Technologry n : 194); oats (Avena sativa; 
Somers et al . (1992) Bio/Technology 10: 1589); 
orchardgrass (Dactylis glomerata; Horn et al . (1988) 
Plant Cell Rep. 7: 469); rice [Oryza sativa, including 

20 indica and japonica varieties; Toriyama et al . (1988) 
Bio/Technology 6: 10; Zhang et al . (1988) Plant Cell 
Rep. 7: 379; Luo and Wu (1988) Plant Mol . Biol. Rep. 6: 
165; Zhang and Wu (1988) Theor. Appl . Genet. 76: 835; 
Christou et al . (1991) Bio /Technology 9: 957); rye 

25 (Secale cereale; De la Pena et al . (1987) Nature 325: 
274); sorghum {Sorghum bicolor; Cassas et al . (1993) 
Proc. Natl. Acad. Sci. USA 90: 11212); sugar cane 
(Saccharum spp . ; Bower and Birch (1992) Plant J. 2: 
409); tall fescue (Festuca arundinacea; Wang et al . 

30 (1992) Bio/Technology 10: 691); turfgrass (Agrostis 

palustris; Zhong et al . (1993) Plant Cell Rep. 13: 1) ; 
and wheat (Triticum aestivum; Vasil et al . (1992) 
Bio/Technology 10: 667; Weeks et al . (1993) Plant 
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Physiol. 102: 1077; Becker et al . (1994) Plant J. 5: 
299) . 

Plant transformation vectors capable of delivering 
DNAs (genomic DNAs , plasmid DNAs , cDNAs, or synthetic 
5 DNAs) encoding plant -derived or other enzymes that 
affect the biosynthesis and accumulation of sterol 
compounds and tocopherols in plants for optimizing the 
pools of sitosterol, sitostanol, esters of either, and 
tocopherols, and for reducing the levels of 

10 campesterol, campestanol, and/or their esters, can be 
easily designed by art -recognized methods. Various 
strategies can be employed to introduce these encoding 
DNAs into plants to produce transgenic plants that 
biosynthesize and accumulate desirable levels of 

15 various sterol compounds and tocopherols, including: 

1. Transforming individual plants with an 
encoding DNA of interest. Two or more transgenic 
plants, each containing one of these DNAs, can then be 
grown and cross-pollinated so as to produce hybrid 

20 plants containing the two DNAs. The hybrid can then be 
crossed with the remaining transgenic plants in order 
to obtain a hybrid plant containing all DNAs of 
interest within its genome. 

2 . Sequentially transforming plants with plasmids 
25 containing each of the encoding DNAs of interest, 

respectively. 

3 . Simultaneously cotransf orming plants with 
plasmids containing each of the encoding DNAs, 
respectively . 

30 4 . Transforming plants with a single plasmid 

containing two or more encoding DNAs of interest . 

5. Transforming plants by a combination of any of 
the foregoing techniques in order to obtain a plant 
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that expresses a desired combination of encoding DNAs 
of interest . 

Traditional breeding of transformed plants 
produced according to any one of the foregoing methods 

5 by successive rounds of crossing can then be carried 
out to incorporate all the desired encoding DNAs in a 
single homozygous plant line (Nawrath et al . (1994) 
Proc. Natl. Acad. Sci . USA 91: 12760; PCT International 
Publication WO 93/02187) , or to produce hybrid 

10 offspring. 

In methods 2 and 3, the use of vectors containing 
different selectable marker genes to facilitate 
selection of plants containing two or more different 
encoding DNAs is advantageous. Examples of useful 

15 selectable marker genes include those conferring 

resistance to kanamycin, hygromycin, sulphonamides , 
glyphosate, bialaphos, and phosphinothricin . 



C . Processes of Regeneration 
20 Processes of regeneration of plants from 

transformed protoplasts are known in the art. 



D . Transgenic Plants and Progeny 

The present invention contemplates the plants that 
25 contain the exogenous constructs according to the 

present invention, such that a plant comprises at least 
one transformed plant cell comprising a nucleic acid 
construct. The nucleic acid construct, as described 
above has as operably linked components in the 5 ' to 3 ' 
30 direction, a promoter, a DNA sequence encoding a 
polypeptide exhibiting 3 -hydroxy-3 -methylglutaryl - 
Coenzyme A reductase enzyme activity, and a 
transcription termination signal sequence. The plant 
also comprises a nucleic acid construct that has as 
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operably linked components in the 5' to 3 ' direction, a 
promoter, a DNA sequence encoding a steroid pathway 
enzyme, and a transcription termination signal 
sequence. The steroid pathway enzyme is a squalene 
5 epoxidase enzyme, a sterol methyl transferase I enzyme, 
a sterol C4-demethylase enzyme, a obtusifoliol C14cc- 
demethylase enzyme, a sterol C5-desaturase enzyme, or a 
sterol methyl transferase II enzyme. In one 
embodiment, the nucleic acid constructs are recombinant 

10 constructs. 

In one embodiment of the present invention a 
transgenic plant can be produced in accordance with the 
processes discussed elsewhere herein. One method to 
arrive at the above construct -containing plant is to 

15 transform the plant cell with a recombinant vector 
harboring such a construct . Other methods involve 
direct transfer of the exogenous construct into the 
plant cell. The methods of arriving at a plant cell 
having exogenous nucleic acids are well known in art 

20 and are applicable to the present invention. In one 

embodiment, the nucleic acid constructs are recombinant 
constructs. In a preferred embodiment, the recombinant 
vector is a plant expression vector. 

The present invention contemplates a plant, the 

25 genome of which comprises introduced DNA. That 
introduced DNA has at least two components. One 
component is a DNA encoding a 3-hydroxy-3- 
methyulglutaryl -Coenzyme A reductase enzyme. The other 
component is DNA encoding a steroid pathway enzyme that 

30 is a squalene epoxidase enzyme, a sterol methyl 

transferase I enzyme, a sterol C4 -demethylase enzyme, a 
obtusifoliol C14a-demethylase enzyme, a sterol C5- 
desaturase enzyme, or a sterol methyl transferase II 
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enzyme. The storage organs, preferably seeds, of such 
a plant contain an elevated level of total accumulated 
sterol, compared to storage organs of an otherwise 
identical plant, the genome of which does not comprise 
5 the introduced DNA. The introduced DNAs are 

operatively linked to regulatory signals, preferably 
that cause seed-specific expression of said introduced 
DNAs . The seeds of such a plant contain a reduced 
level of squalene, cycloartenol , 24 -methylene 

10 cycloartenol, obtusif oliol , stigmasta-7-enol , or 
campesterol compared to the seeds of an otherwise 
identical plant or compared to a plant comprising an 
introduced DNA encoding a HMG CoA reductase enzyme 
without the contemplated steroid pathway enzyme. 

15 Also contemplated is a plant with introduced DNA, 

as described above, that produces seed having an 
elevated level of a steroid pathway product, compared 
to a corresponding transgenic or non- transgenic plant 
that does not contain said introduced DNA. 

20 The invention also contemplates a plant comprising 

introduced DNA encoding (i) an HMGR enzyme and (ii) a 
squalene epoxidase enzyme, a sterol methyl transferase 
I enzyme, a sterol C4 -demethylase enzyme, a 
obtusif oliol C14a-demethylase enzyme, a sterol C5- 

25 desaturase enzyme, a sterol methyl transferase II 

enzyme, or mixtures thereof, wherein said plant that 
produces a storage organ (preferably a seed) having an 
elevated level of a sterol pathway product compared to 
a corresponding transgenic or non- transgenic plant that 

30 does not contain said introduced DNA. 

The invention also contemplates a plant having 
introduced DNA, as described above, that produces a 
storage organ (preferably a seed) having a reduced 
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level of squalene, cycloartenol , 24 -methylene 
cycloartenol , obtusif oliol , stigmasta-7-enol , 
campesterol, or mixtures thereof, compared to a 
corresponding transgenic plant that comprises 

5 introduced DNA encoding an HMGR enzyme but that does 
not contain introduced DNA encoding a squalene 
epoxidase enzyme, a sterol methyl transferase I enzyme, 
a sterol C4 -demethylase enzyme, a obtusif oliol C14cc- 
demethylase enzyme, a sterol C5-desaturase enzyme, a 

10 sterol methyl transferase II enzyme, or mixtures 
thereof . 

For any of the above plants, an embodiment is 
contemplated wherein the introduced nucleic acid has 
regulatory signals that cause seed-specific expression 

15 of said introduced DNAs . 

The progeny of the above-described plants are also 
considered an embodiment of the present invention, as 
are plant cells or transformed plant cells. Cultures 
of those plant cells are also contemplated. Plants 

20 produced from seeds having introduced DNA are also 
embodiments of the present invention. 

A further embodiment of the present invention is a 
method of producing a plant that accumulates an 
elevated level of sterol pathway products, in seed of 

25 said plant compared to seed of a corresponding plant 
comprising no introduced DNA encoding a peptide, 
polypeptide, or protein that affects the biosynthesis 
and accumulation of a sterol pathway product, 
comprising sexually crossing a plant having introduced 

30 nucleic acid with the corresponding plant having no 
introduced DNA. Plants, including apomictic plants 
produced by this method are contemplated. 
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Another embodiment is a seed resulting from a 
cross of a plant having introduced DNA, described 
above, with a nurse cultivar. Also contemplated are 
seeds of any of the above-described plants. Also part 
5 of the present invention plant parts, other than a seed 
of any of the above-described plants. 

Uniform populations of the above-described plants 
are also contemplated. 

10 E . Stability of Transqene Expression 

As several overexpressed enzymes may be required 
to produce optimal levels of substrates for the 
biosynthesis of sterol compounds and tocopherols, the 
phenomenon of co- suppression may influence transgene 

15 expression in transformed plants. Several strategies 
can be employed to avoid this potential problem 
(Finnegan and McElroy (1994) Bio/Technology 12: 883) . 

One commonly employed approach is to select and/or 
screen for transgenic plants that contain a single 

20 intact copy of the transgene or other encoding DNA 
(Assaad et al . (1993) Plant Mol . Biol. 22: 1067; 
Vaucheret (1993) C.R. Acad. Sci. Paris, Science de la 
vie/Life Sciences 316: 1471; McElroy and Brettell 
(1994) TIBTECH 12: 62). Agrobacteri um-mediated 

25 transformation technologies are preferred in this 
regard . 

Inclusion of nuclear scaffold or matrix attachment 
regions (MAR) flanking a transgene has been shown to 
increase the level and reduce the variability 
30 associated with transgene expression in plants (Stief 
et al. (1989) ffature 341: 343; Breyne et al . (1992) 
Plant Cell 4: 463; Allen et al . (1993) Plant Cell 5: 
603); Mlynarova et al . (1994) Plant Cell 6: 417; Spiker 
and Thompson (1996) Plant Physiol. 110: 15). Flanking a 
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transgene or other encoding DNA with MAR elements may 
overcome problems associated with differential base 
composition between such transgenes or encoding DNAs 
and integrations sites, and/or the detrimental effects 

5 of sequences adjacent to transgene integration sites. 
The use of enhancers from tissue-specific or 
developmentally-regulated genes may ensure that 
expression of a linked transgene or other encoding DNA 
occurs in the appropriately regulated manner. 

10 The use of different combinations of promoters, 

plastid targeting sequences, and selectable markers for 
introduced transgenes or other encoding DNAs can avoid 
potential problems due to trans- inactivat ion in cases 
where pyramiding of different transgenes within a 

15 single plant is desired. 

Finally, inactivation by co- suppression can be 
avoided by screening a number of independent transgenic 
plants to identify those that consistently overexpress 
particular introduced encoding DNAs (Register et al . 

20 (1994) Plant Mol. Biol. 25: 951). Site-specif ic 

recombination in which the endogenous copy of a gene is 
replaced by the same gene, but with altered expression 
characteristics, should obviate this problem (Yoder and 
Goldsbrough (1994) Bio/Technology 12: 263). 

25 Any of the foregoing methods, alone or in 

combination, can be employed in order to insure the 
stability of transgene expression in transgenic plants 
of the present invention. 

30 F . Hybrid Plants 

The invention contemplates a plant having 
introduced DNA encoding an HMGR and at least one of the 
six steroid pathway enzymes, as described in detail 
above. It is contemplated that a transgenic plant 
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having DNA encoding an HMGR, as is known in the art, 
might be crossed with a transgenic plant having DNA 
encoding at least one of the six steroid pathway 
enzymes . 

5 Also contemplated as a hybrid plant according to 

the invention is a plant that is a hybrid of a 
transgenic plant having introduced DNA encoding an HMGR 
and at least one of the six steroid pathway enzymes 
wherein the plant has been hybridized with another 

10 strain, yet still retains the introduced DNA. 

G . Storage Organs 

The term "storage organ" as used herein, refers to 
the seeds, fruits or vegetable parts of a plant. Most 

15 often the seed is important for use in the present 

invention. However, there are consumable embodiments, 
such as with potatoes or carrots, where the vegetable 
parts are preferred. 

A contemplated embodiment of the present invention 

20 is a storage organ comprising at least one transformed 
host cell. The transformed host cell has at a minimum 
a construct according the invention as described above. 
Also contemplated are the embodiments when the 
construct has plant promoters, when the construct is 

25 recombinant, when the construct is part of a vector, 
and when the vector is a plant expression vector. 

The invention contemplates a transgenic plant seed 
transformed with a vector comprising a DNA segment that 
encodes a polypeptide having HMG-CoA reductase 

30 activity, and a DNA segment that encodes a polypeptide 
having a steroid pathway enzyme, wherein the transgenic 
plant seed is capable of germinating into a transgenic 
plant that over -accumulates steroid pathway products 
relative to a native, non-transgenic plant of the same 
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strain; and mutants, recombinants, genetically 
engineered derivatives thereof and hybrids derived 
therefrom, wherein said mutants, recombinants, 
genetically engineer derivatives thereof and hybrids 

5 derived therefrom maintain the ability to 
overaccumulate steroid pathway products. 

Seed from a transgenic plant is grown in the field 
or greenhouse, and resulting sexually mature transgenic 
plants are self -pollinated to generate true breeding 

10 plants. The progeny from these plants become true 

breeding lines that are evaluated for steroid compound 
or squalene accumulation, preferably in the field under 
a range of environmental conditions. 

The commercial value of a transgenic plant with 

15 increased steroid compound or squalene accumulation is 
enhanced if many different hybrid combinations are 
available for sale. The user typically grows more than 
one kind of hybrid based on such differences as time to 
maturity, standability or other agronomic traits. 

20 Additionally, hybrids adapted to one part of a country 
are not necessarily adapted to another part because of 
differences in such traits as maturity, disease and 
herbicide resistance. Because of this, steroid 
compound or squalene accumulation is preferably bred 

25 into a large number of parental lines so that many 
hybrid combinations can be produced. 

Adding an enhanced steroid compound or squalene 
accumulation trait to an agronomically elite line is 
accomplished by a variety of techniques well known to 

30 those of skill in the art. For example, parent 

transgenic plants that are either homozygous or contain 
a single independent segregatable gene that encodes a 
polypeptide having HMG-CoA activity and thus for 
enhanced sterol or squalene accumulation are crossed 
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with lines having other desirable traits such as 
herbicide resistance (U.S. Pat. No. 4,761,373) produce 
hybrids. Preferably, transgenic plants homozygous for 
enhanced sterol or squalene accumulation are used to 

5 generate hybrids . 

For example, a transgenic plant homozygous for 
enhance sterol accumulation is crossed with a parent 
plant having other desired traits. The progeny, which 
are heterozygous or independently segregatable for 

10 enhanced sterol accumulation and their other desired 
traits. The backcrossing of progeny with the parent 
may have to be repeated more than once to obtain a 
transgenic plant that possesses all desirable traits. 

Alternatively, transgenic plants with an enhanced 

15 sterol or squalene accumulation trait are made multiply 
transgenic by introducing into such plants other genes 
that encode and express other desirable traits, or are 
mutated as with radiation, e.g. X-rays or gamma rays, 
as in U.S. Pat. No. 4,616,099, whose disclosures are 

20 incorporated by reference. Thus, the present invention 
also contemplates mutants and genetically engineered 
derivatives of transgenic plants having enhanced sterol 
or squalene accumulation. 
VI . Harvest 

25 Besides seed, elevated levels of sterols, 

phytosterols, such as sitosterol, phytostanols , such as 
sitostanol, and esters thereof, can be found in other 
parts of the plants encompassed herein. While the 
seed-specific promoters contemplated in the present 

30 invention function preferentially in seed tissues, 
expression in other plant parts can be expected, 
depending upon the specificity of the particular 
promoter. In this case, promoters functional in plant 
plastids are less desirable than those primarily 
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directing expression in the cellular cytosol, though it 
may be desirable to use promoters to drive expression 
of the recombinant constructs or expression cassettes 
disclosed herein in tissues and organs other than 

5 seeds. For example, elevated levels of sterols, 

phytosterols , etc., can be expected in fruits, as well 
as vegetable parts of plants other than seeds. 
Vegetable parts of plants include, for example, pollen, 
inflorescences, terminal buds, lateral buds, stems, 

10 leaves, tubers, and roots. Thus, the present invention 
also encompasses these and other parts of the plants 
disclosed herein that contain elevated levels of 
desirable phytosterol, and phytostanol . 

Of course, a significant effect of introducing 

15 into plants the coding sequences disclosed herein will 
be on the content of phytosterols/phytostanols and 
their esters of seed oil. Therefore, additional 
aspects of the present invention include oil obtainable 
from the seed of the plants described herein, and 

20 methods for producing such plants and oil. Methods for 
extracting and processing seed oils are well known in 
the art . 

Oils produced by the cells, plants, and methods 
disclosed herein are superior in 

25 phytosterol/phytostanol composition to conventional 
oils in a variety of ways. Oil of the present 
invention can contain an elevated level of at least one 
sterol, at least one phytosterol, at least one 
phytosterol ester, at least one phytostanol, at least 

30 one phytostanol ester, or mixtures thereof. Preferred 
compounds include sitosterol, sitostanol, and their 
esters . 

Oil from seed of plants containing and expressing 
introduced DNA encoding a sterol methyltransf erase II 
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enzyme advantageously contains a reduced level of 
campe sterol , at least one campesterol ester, 
campestanol , at least one campestanol ester, or 
mixtures thereof. The sterol methyltransf erase II- 

5 encoding DNA can be introduced alone, or in combination 
with other introduced DNA sequences encoding enzymes 
affecting the biosynthesis of steroid compounds as 
discussed herein. Campesterol/campestanol and their 
esters are considered to be undesirable because they 

10 are readily absorbed in the intestine, while their 
safety in the blood is unknown. Employing the plants 
and methods disclosed herein, one can obtain seed oil 
comprising about 0% to about 19%, preferably about 0% 
to about 12%, more preferably about 5% to about 9% 

15 campesterol, at least one campesterol ester, 

campestanol, at least one campestanol ester, or 
mixtures thereof by weight of the total sterol 
compounds of the oil. (The levels of these compounds 
are difficult to express on a percent seed dry weight 

20 basis because different seeds contain different 

percentages of these compounds expressed on this basis) 
These values represent a reduction of about 10% to 
about 10 0% in the amount of these compounds compared to 
those in conventional oils. 

25 Introduction into plant cells of the enzyme- 

encoding DNA sequences discussed above modifies the 
biosynthesis of sterol compounds carried out by the 
methods, and in the cells, plants, and seeds, disclosed 
herein. In particular, the expression of an HMG CoA 

30 reductase in conjunction with DNA sequences for a 
steroid pathway enzyme is expected to result in 
alteration of the steroid pathway product profiles in 
oil as the enhanced steroid pathway throughput produces 
substrates for the enhanced enzyme activity. The novel 
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phytostanol ester compositions, e.g., sitostanol ester 
compositions, thus produced constitute another aspect 
of the present invention. 

5 A. Harvest of Steroid Compounds 

Methods for the derivation of steroid compounds 
from cells are well known in the art. The invention 
contemplates the recovery of biosynthesized steroid 
compounds from the leaves and/or stems of plants, from 

10 plant seeds, from plant's vegetative organs, from 

callouses, and from cell cultures of plants, yeasts or 
eukaryotic cells. 

Different sources of steroid compounds are 
preferred for various plants. For use as a food or a 

15 food component as discussed later, the steroid 

compounds need not be isolated or purified to 100 
percent purity. Steroid compound- enriched plants may 
be utilized directly. 

For example, from tobacco or Arabidopsis , it may 

20 be preferable to extract a pulp of the leaves and 
stems. From tomato, potato, or corn, it may be 
preferable to use the tomato, potato or corn in the 
form of the familiar storage organs that are typically 
consumed either directly, or a derivative thereof, such 

25 as tomato paste, potato flakes, vegetable oil and many 
more that are well known in the food science arts. 

If desired, after cultivation, the transgenic 
plant is harvested to recover the sterol or squalene 
product. This harvesting step can consist of 

30 harvesting a callus culture, the entire plant, or only 
the leaves, or roots of the plant. This step can 
either kill the plant or, if only a non-essential 
portion of the transgenic plant is harvested, can 
permit the remainder of the plant to continue to grow. 
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In preferred embodiments, this harvesting step 
further comprises the steps of: 



(i) homogenizing at least a sterol - 
5 containing or a squalene-containing 

portion of the transgenic plant to 
produce a plant pulp and using the 
sterol -or squalene-containing pulp 
directly, as in dried pellets or tablets 
10 as where an animal food is contemplated; 

or 



(ii) extracting the squalene or 
sterol (s) from the plant pulp with an 

15 appropriate solvent such as an organic 

solvent or by supercritical extraction 
[Favati, et al . , J. Food Sci . 53:1532 
(1988) and the citations therein] to 
produce a sterol -or squalene-containing 

20 liquid solution or suspension; and 



(iii) isolating the squalene or 
sterol (s) from the solution or 
suspension . 

25 

At least a portion of the transgenic plant is 
homogenized to produce a plant pulp using methods well 
known to one skilled in the art. This homogenizat ion 
can be done manually, by a machine, or by a chemical 
30 means as long as the transgenic plant portions are 
broken up into small pieces to produce a plant pulp. 
This plant pulp consists of a mixture of squalene or 
the sterol of interest, residual amounts of precursors, 
cellular particles and cytosol contents. This pulp can 
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be dried and compressed into pellets or tablets and 
eaten or otherwise used to derive the benefits, or the 
pulp can be subjected to extraction procedures. 

The sterol or squalene can be extracted from the 

5 plant pulp produced above to form a sterol -or- squalene- 
containing solution or suspension. Such extraction 
processes are common and well known to one skilled in 
this art. For example, the extracting step can consist 
of soaking or immersing the plant pulp in a suitable 

10 solvent. This suitable solvent is capable of 

dissolving or suspending the squalene or sterol present 
in the plant pulp to produce a sterol -or squalene- 
containing solution or suspension. Solvents useful for 
such an extraction process are well known to those 

15 skilled in the art and included several organic 

solvents and combinations thereof such as methanol, 
ethanol, isopropanol, acetone, acetonitrile, 
tetrahydrofuran (THF) , hexane, and chloroform as well 
as water-organic solvent mixtures. A vegetable oil 

20 such as peanut, corn, soybean and similar oils can also 
be used for this extraction as can steam distillation. 

A whole plant or callus culture with an added, 
exogenous structural gene for a polypeptide having HMG- 
CoA reductase activity is grown under suitable 

25 conditions for a period of time sufficient for squalene 
or sterols to be synthesized and accumulated. The 
sterol - squalene -containing plant cells, preferably in 
dried form, are then lysed chemically or mechanically, 
and the squalene or sterol is extracted from the lysed 

30 cells using a liquid organic solvent or steam 

distillation, as described before, to form a sterol- or 
squalene -containing liquid solution or suspension by 
usual means such as chromatography. 
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The squalene or sterol is isolated from the 
solution or suspension produced above using methods 
that are well known to those skilled in the art of 
squalene and sterol isolation. These methods include, 
but are not limited to, purification procedures based 
on solubility in various liquid media, chromatographic 
techniques such as column chromatography and the like. 

The invention contemplates a sitosterol or 
sitostanol ester composition extracted from the seed of 
a transgenic plant of the invention. The invention 
also contemplates such a sitosterol or sitostanol ester 
wherein an esterifying fatty acid has 2 to 22 carbon 
atoms in the main chain. 

B . Harvest of Oil 

The novel biosynthetic composition of the oil in 
the transgenic plants is contemplated. Thus, the 
present invention contemplates oil containing at least 
one sterol pathway product, extracted from seed of a 
described transgenic plant. Preferably, sitosterol, at 
least one sitosterol ester, or mixtures thereof, 
comprise at least about 50% by weight of the total 
sterol compounds of the oil. Preferably, sitosterol, 
at least one sitosterol ester, or mixtures thereof, 
comprise at least about 0.08% of the dry weight of a 
contemplated seed. Preferably, the oil has a reduced 
amount of squalene, cycloartenol , 24 -methylene 
cycloartenol , obtusif oliol , stigmasta-7-enol , 
campesterol, or mixtures thereof, compared to oil from 
a corresponding transgenic plant that does not contain 
introduced DNA encoding a squalene epoxidase enzyme, a 
sterol methyl transferase I enzyme, a sterol C4- 
demethylase enzyme, a obtusif oliol C14a-demethylase 
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enzyme, a sterol C5 -desaturase enzyme, a sterol methyl 
transferase II enzyme, or mixture thereof, and that 
reduction is in the range of from about 10% to about 
100% . 

Oil is extracted from transgenic plant seeds of 
the present invention by method well known in the art. 
By way of example, oil can be extracted from plant 
seeds using extraction methods set forth above for 
harvesting sterols and squalene from transgenic plants. 
Alternatively, oil can be extracted from transgenic 
plant seeds by usually used methods for obtaining seed 
oils such as by crushing he seeds to produced a pulp 
and then pressing the pulp to obtain oil . The pulp can 
also be extracted with appropriate solvents (e.g. 
benzene) to obtain the oil. Industrial Chemistry: A 
Manual for the Student and Manufacturer, ed. By A. 
Rogers and A. B. Aubert, D. Van Nostrand Co., New York, 
pages 547-548 (1912) . 

C. Uses of Oil 

As discussed in the "Description of Related Art," 
phytostanols such as sitostanol are beneficial for 
lowering serum cholesterol (Ling et al . (1995) Life 
Sciences 57: 195-2 06) and preventing cardiac disease. 
Tocopherols act as antioxidants, and play a major role 
in protecting cells from damage caused by free radicals 
(Halliwell (1997) Nutrition Review 55: 44-60). As the 
amount of sitostanol in conventional vegetable and bran 
oils is low relative to that of other sterol compounds, 
the oils of the present invention are particularly 
useful for reducing the concentration of low density 
lipoprotein cholesterol in plasma. 

Thus, further aspects of the present invention 
include the following: 
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Cholesterol -lowering compositions comprising the 
oils and sitostanol ester compositions disclosed 
herein. Such cholesterol -lowering compositions can 
take the form of, or be used in, foods, food products, 

5 processed foods, food ingredients, food additive 

compositions, or dietary supplements that contain oils 
and/or fats. Non-limiting examples include margarines; 
butters; shortenings; cooking oils; frying oils; 
dressings, such as salad dressings; spreads; 

10 mayonnaises; and vitamin/mineral supplements. Patent 
documents relating to such compositions include U.S. 
Patents 4,588,717 and 5,244,887, and PCT International 
Publication Nos . WO 96/38047, WO 97/42830, WO 98/06405, 
and WO 98/06714. Additional non-limiting examples 

15 include toppings; dairy products such as cheese and 
processed cheese; processed meat; pastas; sauces; 
cereals; desserts, including frozen and shelf -stable 
desserts; dips; chips; baked goods; pastries; cookies; 
snack bars; confections; chocolates; beverages; 

20 unextracted seed; and unextracted seed that has been 
ground, cracked, milled, rolled, extruded, pelleted, 
defatted, dehydrated, or otherwise processed, but which 
still contains the oils, etc., disclosed herein. 
Food additive compositions of the present 

25 invention can be made by a method comprising obtaining 
oil containing a phytostanol or phytostanol ester 
selected from sitostanol, at least one sitostanol 
ester, or mixtures thereof, from cultured cells, or 
seeds of a plant, of the present invention, and evenly 

30 distributing the oil or desired phytostanol compound 
in finely divided form throughout the food product or 
food additive composition to which it is added by 
dissolution or by suspension in an emulsion. For 
example, the oil or phytostanol compound can be 
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dissolved in an edible solubilizing agent, or can be 
mixed with an edible solubilizing agent, an effective 
amount of a dispersant, and optionally, an effective 
amount of an antioxidant. Examples of useful edible 
solubilizing agents include, but are not limited to, 
monoglycerides, diglycerides , triglycerides, vegetable 
oils, tocopherols, alcohols, polyols, or mixtures 
thereof. Examples of useful antioxidants include, but 
are not limited to, tocopherols, such as -tocopherol, 
ascorbic acid, inexpensive synthetic antioxidants, and 
mixtures thereof. Effective carriers for preparing 
emulsions or suspensions include water, alcohols, 
polyols, other edible compounds in which the oil or 
phytostanol compound is soluble or insoluble, and 
mixtures thereof. Examples of useful dispersants 
include, but are not limited to, lecithin, other 
phospholipids, sodium lauryl sulfate, fatty acids, 
salts of fatty acids, fatty acid esters, other 
detergent -like molecules, and mixtures thereof. 
Alternatively, the food additive composition can be 
made by a method comprising obtaining oil containing at 
least one tocopherol, and a phytostanol or phytostanol 
ester selected from sitostanol, at least one sitostanol 
ester, and mixtures thereof, from cultured cells, or 
seed of a plant, of the present invention, and mixing 
the oil with an edible solubilizing agent and an 
effective amount of a dispersant. Again, the edible 
solubilizing agent can include, but is not limited to, 
monoglycerides, diglycerides, triglycerides, vegetable 
oils, tocopherols, alcohols, polyols, or mixtures 
thereof, and the dispersant can include, but is not 
limited to, lecithin, other phospholipids, sodium 
lauryl sulfate, fatty acids, salts of fatty acids, 
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fatty acid esters, other detergent -like molecules, and 
mixtures thereof . 

The cholesterol -lowering compositions can also 
take the form of pharmaceutical compositions comprising 
a cholesterol -lowering effective amount of the oils or 
sitostanol ester compositions disclosed herein, along 
with a pharmaceutically acceptable carrier, excipient, 
or diluent. These pharmaceutical compositions can be 
in the form of a liquid or a solid. Liquids can be 
solutions or suspensions; solids can be in the form of 
a powder, a granule, a pill, a tablet, a gel, or an 
extrudate. U.S. Patent 5,270,041 relates to sterol - 
containing pharmaceutical compositions. 

Any of the foregoing cholesterol -lowering 
compositions can be used alone or in combination in 
methods to lower the risk of developing an elevated 
plasma concentration of low density lipoprotein 
cholesterol, to lower the plasma concentration of low 
density lipoprotein cholesterol, or to treat or prevent 
an elevated plasma concentration of low density 
lipoprotein cholesterol. Such methods comprise orally 
administering to a human or animal subject an effective 
amount of cholesterol-lowering composition. What 
constitutes an effective amount of cholesterol -lowering 
composition can be determined empirically, and depends 
in part on a variety of factors, including the age, 
weight, sex, diet, general medical condition of the 
subject, and the severity of hypercholesterolemia. 
Subjects undergoing treatment with the cholesterol - 
lowering combinations disclosed herein can be monitored 
by routine measurement of serum cholesterol levels to 
determine the effectiveness of therapy. Continuous 
analysis of the data obtained in this way permits 
modification of the treatment regimen during therapy so 
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that optimal effective amounts of the cholesterol- 
lowering compositions of this invention are 
administered, and so that the duration of treatment can 
be determined as well. In this way, the treatment 
regimen/dosing schedule can be rationally modified over 
the course of treatment so as to achieve the lowest 
cholesterol -lowering effective amount of the present 
compositions which results in satisfactory anti- 
cholesterolemic effectiveness, and so that 
administration of these compositions is continued only 
so long as is necessary to successfully treat this 
condition. In general, an effective amount of a 
cholesterol -lowering composition of the present 
invention in the form of a phytostanol- or phytostanol 
ester-containing composition is in the range of from 
about 0.1 gm/day to about 4.5 gm/day. By way of 
example, a phytostanol ester composition, for example a 
sitostanol ester composition, can be administered in an 
amount in the range of from about 0.1 gm/day to about 
4.5 gm/day, preferably from about 1 gm/day to about 4.5 
gm/day, more preferably from about 2 gm/day to about 
4.5 gm/day. A phytostanol composition, for example a 
sitostanol composition, can be administered in an 
amount in the range of from about 0.1 gm/day to about 3 
gm/day, preferably from about 1 gm/day to about 3 
gm/day, more preferably from about 2 gm/day to about 3 
gm/ day . 

The cholesterol -lowering compositions of the 
present invention can be administered daily to patients 
in accordance with a number of different regimens. 
Fundamentally, these compositions should be 
administered in a cholesterol -lowering effective amount 
for a period of time effective to exert their anti- 
hypercholesterolemic preventing, reducing, or reversing 
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action. Administration of the present cholesterol- 
lowering compositions should be continued until the 
hypercholesterolemia condition has been controlled or 
eliminated . 

5 Another method encompassed by the present 

invention is that of achieving or improving effective 
absorption of sitostanol into a host, comprising 
producing at least one sitostanol ester by any of the 
methods disclosed herein, and administering this 

10 sitostanol ester to a host, which can be a human or 

animal. The sitostanol ester can be administered by a 
route selected from oral route, parenteral route, or 
topical route. The dose, which can be administered 
daily, can be up to about 10 milligrams of the 

15 sitostanol ester per kilogram of body weight. U.S. 
Patent 5,2 02,045 relates to the use of stanol fatty 
acid esters to reduce serum cholesterol . 

Also envisioned are plants which in addition to 
having increased levels of phytosterols and 

20 phytostanols due to the presence of constructs 
comprising sequences encoding 3-hydroxy-3- 
methylglutaryl- Coenzyme A reductase and at least one 
other sterol synthesis pathway enzyme, have increase 
levels of tocopherol due to the presence of constructs 

25 comprising sequences allowing the overexpression of 
enzymes in the tocopherol biosynthetic pathway. 

Tocopherol levels vary in different plants, 
tissues, and developmental stages, indicating a highly 
regulated biosynthetic pathway. The production of 

30 homogentisic acid by p-hydroxyphenylpyruvate 

dioxygenase is likely to be a regulatory point for bulk 
flow through the pathway because of irreversible enzyme 
action and because homogentisic acid production is the 
first committed step in tocopherol biosynthesis (Norris 
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et al., 1995, Plant Cell 7: 2139-2149). The other key 
regulatory step in tocopherol biosynthesis is the 
availability of the phytylpyrophosphate pool. Feeding 
studies (Furuya et al., 1987, Phytochem. , 26: 2741- 
2747) in saf flower callus culture demonstrated 1.8-fold 
and 18 -fold increases in tocopherol synthesis by 
feeding homogentisate and phytol, respectively. In 
meadow rescue leaf, vitamin E increases in the initial 
phase of foliar senescence when phytol is cleaved off 
from the chlorophylls and when free phytol is available 
(Peskier et al . , 1989, J. Plant Physiol. 135: 428-432). 
These reports suggest tight coupling of tocopherol 
biosynthesis to the availability of homogentisic acid 
and phytol . 

Transformation of plants with nucleic acid 
constructs that increase the biosynthetic activity of 
the tocopherol pathway can lead to increased production 
of particular tocopherol isomers, for example, ct- 
tocopherol, are known in the art and can be found, for 
example, in PCT International publication WO 00/61771 
which is incorporated herein by reference. Formation 
of a-tocopherol from other tocopherols occurs due to S- 
adenosylmethionine (SAM) -dependent methylases such as 
y-tocopherol methyl transferase. Overexpression of 
methyl transferases in combination with 3-hydroxy-3- 
methylglutaryl -Coenzyme A reductase and at least one 
other sterol synthesis pathway enzyme as described 
herein is also contemplated in the present methods. 
Thus, any of the DNAs encoding enzymes of the 
tocopherol biosynthetic pathway, discussed above, are 
useful in the present invention. Transformation of 
plants with an early tocopherol biosynthesis gene is 
sufficient to produce plants having an elevated level 
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of tocopherols. By "early tocopherol biosynthesis 
gene" is meant DNA encoding geranylgeranylpyrophosphate 
synthase, geranylgeranylpyrophosphate hydrogenase, 4- 
hydroxyphenylpyruvate dioxygenase, and phytyl/prenyl 
transferase. DNA encoding enzymes active in later 
steps of tocopherol biosynthesis {"secondary tocopherol 
biosynthesis genes") can be expressed to enhance carbon 
flux through the tocopherol pathway even further, and 
to produce specific tocopherol isomers. In this way, 
the tocopherol biosynthetic pathway can be modified to 
enhance production of any tocopherol compound of 
interest, such as cc-tocopherol . As noted above, a 
variety of sources are available for the early 
tocopherol biosynthesis genes (and other tocopherol 
biosynthesis genes) , and a gene from any of these 
sources can be utilized. If co-suppression occurs when 
a plant gene native to the target host plant is used to 
increase expression of a particular enzyme, a coding 
sequence from another source can be used as an 
alternative . 

Preferred genes for introduction into plants to 
alter tocopherol quant ity/ quality include 3-deoxy-D- 
arabino-heptulosonate-7-P synthase (DAHP synthase) , 
shikimate kinase, either or both of the prephenate 
dehydrogenases, 1-deoxy-d-xylulose 5 -phosphate 
synthetase (DXS) , 1-deoxy-d-xylulose 5-phosphate 
reductoisomerase (DXR) , 4-diphosphocytidyl-2C-methyl-d- 
erythritol synthase (YgbP) , 4 -diphosphocyt idyl -2C- 
methyl-d-erythritol kinase (YchB) , 2C-methyl-d- 
erythritol 2 , 4 -cyclodiphosphate synthase (YgbB) , the 
gene product of GcpE, LytB (Altincicek et al . , 2001, J. 
Bacterid. , 183:2411-2416; Altincicek et al . , 2001, J". 
Immunol., 166:3655-3658; Campos et al . , 2001, FEBS 



125 MTC 6783.1 

PATENT 

Lett., 488:170-173), geranylgeranylpyrophosphate 
synthase (GGPPS) , geranylgeranylpyrophosphate 
hydrogenase (GGH) , phytyl /prenyl transferase (PPT) , 4- 
hydroxy-phenylpyruvate dioxygenase (HPPD) , 2 -methyl -6- 
phytylplastoquinol tocopherol methyl transferase I 
(MTI) , tocopherol cyclase, y- tocopherol 
methyltransferase (GMT) a plant sir 1736 gene (see 
Cyanobase http://www,kazusa.or.jp/cyaiibase), a plant sir 1737 gene 
(see Cyanobase http://www.kazusa.or.jp/cyanbase), an ATPT2 gene 
(Smith et al . , Plant J., 11:83-92, 1977), and an AANT1 
gene (Saint Guily et al . , Plant Physiol., 100:1069- 
1071, 1992) . 

4 -hydroxy -phenyl pyruvate dioxygenase and 
geranylgeranylpyrophosphate hydrogenase will increase 
the homogentisate and phytol pools, respectively. 
Enzymes that control fluxes through pathways are well 
known to be regulated in higher organisms such as 
plants. Therefore, 4 -hydroxyphe nyl pyruvate dioxygenase 
and geranylgeranylpyrophosphate hydrogenase genes of 
microbial origin which are not subject to regulation in 
plants, or those from higher organisms (plants, algae, 
fungi, etc.) that are deregulated, are especially 
attractive in this regard. A non- limiting example is 
the microbial enzyme 4 -amino-4 -deoxyprephenate 
dehydrogenase (TyrA from Erwinia herbicola) which can 
replace prephenate aminotransferase, arogenate 
dehydrogenase and aminotransferase. Overexpression of 
enzymes such as 3-deoxy-arabino-heptulosonate 7-P 
(DAHP) synthase, prephenate dehydrogenase, and 
shikimate kinase would lead to increases in the levels 
of homogentisate. DNA encoding any of the tocopherol 
biosynthetic enzymes discussed herein can be introduced 
alone or in various combinations to enhance tocopherol 
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quantity and/or alter tocopherol quality. When 
introduction of multiple enzymes is desirable, 
preferred combinations include, but are not limited to, 
4-hydroxyphenylpyruvate dioxygenase (HPPD) plus 

5 geranylgeranylpyro-phosphate hydrogenase (GGH) , 

geranylgeranylpyrophosphate synthase (GGPP synthase) 
plus geranylgeranylpyrophosphate hydrogenase (GGH) , 4- 
amino-4-deoxyprephenate dehydrogenase (TyrA) plus 
phytylprenyltransf erase (PPT) , geranylgeranylpyro- 

10 phosphate hydrogenase (GGH) plus 
phytylprenyltransf erase (PPT) , 

geranylgeranylpyrophosphate synthase (GGPP synthase) 
plus phytylprenyltransferase (PPT), 2 -methyl-6- 
phytylplastoquinol tocopherol methyl transferase I (MTI) 

15 plus phytylprenyltransferase (PPT) , or 2 -methyl -6- 
phytylplastoquinol tocopherol methyltransf erase I 
(MTI) , phytylprenyltransferase (PPT) , 4- 
hydroxyphenylpyruvate dioxygenase (HPPD) and 
geranylgeranylpyrophosphate synthase (GGPP synthase) . 

20 Plants characterized by increase levels of sterol 

and tocopherol production can be produced by 
transforming plant cells or tissues genetically altered 
for increased sterol production by the methods 
described herein with additional nucleic acid 

25 constructs encoding tocopherol biosynthetic enzymes. 

Introduction of constructs encoding tocopherol pathway 
enzymes can be accomplished using standard methods in 
molecular biology such as those described herein or 
those described in PCT International Publication WO 

30 00/61771. Introduction of constructs encoding 3- 

hydroxy- 3 -methylglutaryl - Coenzyme A reductase, at least 
one other sterol synthesis pathway enzyme, and at least 
one tocopherol synthesis pathway enzyme can be 
accomplished in a single transformation or in a series 
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of transformations. For example, and without 
limitation, plant cells transformed with constructs 
encoding 3 -hydroxy- 3 -methylglutaryl -Coenzyme A 
reductase, at least one other sterol synthesis pathway 

5 enzyme as described herein could be selected and then 
further transformed with additional constructs encoding 
one or more tocopherol synthesis pathway enzymes and in 
particular S-adenosylmethionine-dependent y-tocopherol 
methyltransf erase enzyme. Successfully transformed 

10 cells can then be selected and used to regenerate 

plants having increased levels of phytosterols and/or 
phytostanols as well as increased levels of tocopherol. 
Plants produced can then be "selfed", a technique well 
known in the art, to produce uniform populations of 

15 plants. 

Alternatively, plants characterized by increased 
levels of tocopherols and phytosterols and/or 
phytostanols can be produced by traditional plant 
breeding methods. For example, plants transformed with 

20 nucleic acid constructs encoding 3-hydroxy-3- 

methylglutaryl -Coenzyme A reductase and at least one 
other sterol synthesis pathway enzyme can be sexually 
crossed with high tocopherol plants. Any plant 
transformed to produce increased levels of tocopherols 

25 and in particular a- tocopherols can be used. Non- 
limiting examples include plants produced by the 
methods described above and in PCT International 
publication WO 00/61771. 

If desired, the plants produced can be selfed to 
30 produce homozygous, uniform populations of plants. 

Seed obtained from the transgenic, progeny, 
hybrid, etc., plants disclosed herein can be used in 
methods for obtaining oil containing phytosterols, 



128 MTC 6783.1 

PATENT 

phytosterol esters, phytostanols , phytostanol esters, 
or mixtures thereof along with tocopherols employing 
extraction and processing procedures known in the art. 
Note, in this regard, Kochhar (1983) Prog. Lipid Res. 
22: 161-188. Alternatively, seeds with increased levels 
of tocopherols and phytosterols , phytosterol esters, 
phytostanols, phytostanol esters, or mixtures thereof; 
or fruits and vegetables with increased levels of 
tocopherols and phytosterols, phytosterol esters, 
phytostanols, phytostanol esters, or mixtures thereof, 
can be used directly. 

Tocopherols and phytosterols and/or phytostanols 
can then be obtained from deodorized distillates of oil 
seed extracts and in particular soybean oil extracts. 
Such deodorized distillates are expected to contain 
increased levels of both tocopherols and phytosterols 
and/or phytostanol extracts. Oil extracts from plants 
and seed of the present invention are particularly 
valuable in that they allow the production of high 
sterol/tocopherol oils in a single process thus 
resulting in reduced purification costs, processing 
time and waste stream. Methods for the isolation of 
tocopherols and sterols from plant oils are well known 
in the art and can be found, for example, in U.S. 
Patent Nos . 4,454,329; 5,097,012; 5,594,437; and 
5, 981, 781 . 

EXAMPLES 

The following examples are intended to provide 
illustrations of the application of the present 
invention. The following examples are not intended to 
completely define or otherwise limit the scope of the 
invention . 
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Example 1. Enhancement of Phytosterol content in 
seeds of transgenic plants by seed-specific 
overexpression of full-length HMG-CoA reductase (HMGR) 
In order to examine the ability of HMGR 

5 overexpression for increasing sterol compound levels in 
seeds, the following experiment was performed in 
Glycine max. A full-length HMGR gene from rubber 
genomic DNA was expressed in developing Glycine max 
seeds using the 7S promoter. This was achieved by 

10 excising the rubber HMGR gene from the plasmid pHEV15 
(Schaller et al . , (1995) Plant Physiol., 109: 761-770) 
using EcoRI . The 3.8 Kb fragment was inserted into the 
EcdRT site of pMON2 992 0 (Fig. 3) such that the HMGR 
gene is flanked by the 7S promoter on the 5' end and 

15 the E9 3' terminator to create pMON43800 (Fig. 4). This 
was next digested with Sail and Wot I to release a 7.7 
Kb fragment that was then blunt -ended at the Sal I end 
before ligating to pMON23616 (Fig. 5) that was first 
cut with Smal and JVotl. This created the pMON43818 

20 binary vector that contains the rubber HMGR gene driven 
by 7S promoter and the NPTII gene selection marker 
driven by the NOS promoter and 3' NOS terminator. 
PMON43818 (Fig. 6) was used to transform Agrobacterium 
tumefaciens and transform Glycine max cotyledon 

25 explants as described in Example 2 . 

Seeds from 15 transgenic plants and one 
nontransgenic control plant were harvested at maturity. 
Sterol extraction and analysis on ten individual seeds 
per plant were performed as described in Example 2 . 

30 Results are presented in Table 2 . 
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Table 2 : Sterol profile of transgenic soybean 
plants expressing rubber HMGR gene driven by 7s 
5 promoter. Event 1: control, events 2-16:15 transgenic 
plants generated by 15 independent events using 
Agrobacterium mediated transformation. 

Total sterols increased by 3.2- and 3.9- fold in 
the best performing plants (transgenic events 3 and 4) . 
10 These two events also showed the highest increases of 
individual sterols. Campesterol increased by 2.7-fold, 
sitosterol by 3. 4 -fold, sitostanol by 3.2-fold and 
other sterols by 6.5-fold in event 3 while stigmasterol 
increased by 2. 3 -fold in event 4. The other sterols, 
15 which account for the highest increase in total sterols 
were pathway intermediates that included squalene, 
cycloartenol , 24 -methylene cycloartenol , obtusif oliol , 
isofucosterol, and stigmasta-7-enol . These pathway 
intermediates normally form minor constituents in the 
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sterol composition of seeds. However, in the transgenic 
seeds, probably due to increased carbon flux through 
the pathway, they accumulate in significant amounts. 
This suggests additional control points for sterol 
biosynthesis in plants such as squalene epoxidase, C-24 
sterol methyltransf erase, and C-14 obtusifoliol 
demethylase . 

Example 2 : Enhance phytosterol biosynthesis in 
seeds of transgenic soybean plants by seed-specific 
expression of catalytic domain of HMG-CoA Reductase 
(HMGR) alone and in combination with sterol methyl 
transferase II (SMTII) 

In another embodiment of the present 
invention, the levels of sterol compounds, including 
sitosterol, sitostanol, campesterol, stigmasterol and 
at least one ester for each of the sterol compounds and 
mixture there of, can be elevated in plant seeds by 
overexpression of catalytic domain of plant-HMG-CoA 
reductases. One can transform a plant of interest 
using an expression cassette or vector comprising DNA 
encoding a polypeptide exhibiting 3 -hydroxy- 3- 
methylglutaryl -Coenzyme A reductase (HMG-CoA reductase 
or HMGR) activity. HMGR cDNAs from rubber have been 
successfully used to increase plant sterol levels in 
plant tissues (Schallet et al . (1995) Plant Physiol. 
109: 761-770). Full-length and truncated forms of HMGR 
CDNAs encoding full-length and catalytic domain of 
HMGR, respectively, from Arabidopsis have also been 
used to overproduce sterols in transgenic Arabidopsis 
plants (Gonzalez et al . (1997) Third Terpnet Meeting of 
the European Network on Plant Isoprenoids Abstracts, 
Abstract No. 33, page 33) . In the above examples 
however, the genes have not been specifically targeted 
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to increase sterol levels in seeds. Another approach 
to enhance the nutritionally beneficial 24 -ethyl 
sterols (sitosterol, sitostanol) and reduce the 
accumulation of 24 -methyl sterols (campesterol ) in 
seeds one can co-express two genes encoding the enzymes 
HMGR and sterol methyl transferse II (SMTII) , each 
under the control of seed- specif ic promoter. Here we 
present evidence for such approaches : sterol 
composition of transgenic soybean seeds haboring 
truncated form (catalytic domain of HMGR without 
linker) of Arabidopsis HMGR1 is presented in Figure 11 
and Table 3 . Sterol composition of transgenic soybean 
seeds haboring Arabidopsis HMGR1 (catalytic domain of 
HMGR without linker) and Arabidopsis SMTII is presented 
in Figure 12 and Table 4 . 

In order to examine whether overexpression of 
the catalytic domain of HMGR increases sterol levels in 
the seeds of transgenic soybean, the following 
experiment was performed in Glycine max. A truncated 
form of HMGR1 cDNA encoding only the catalytic domain 
of HMGR from Arabidopsis was expressed in developing 
seeds of Glycine max using the seed-specific 7S 
promoter. This was achieved by excising the cDNA 
fragment (HMGRlcd) encoding the HMGR1 catalytic domain 
from the plasmid pHMGRlcd (Dale et al . , (1995) Eur. J. 
Biochem. 233: 506-513) using Ndel and Smal enzymes 
resulting in the isoloation of a 1.9 Kb fragment. The 
Ndel overhang was filled-in and the 1 . 9 Kb fragment 
was blunt-end ligated to vector pMON43818 (Figure 6) , 
previously Xhol (Xhol overhang was filled-in) and Smal 
digested such that the HMGRlcd was flanked by the 7S 
promoter on the 5' end and the E9 3' terminator to 
create a recombinant vector pMON43 0 52 (Figure 7) . This 
was next digested with Xbal and blunt-ended and then 
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digested with NotI to release a 3.4 Kb fragment and 
ligated to pMON51850 (Figure 8) that was digeted with 
Smal and NotI. The ligation created a recombinant 
binary vector pMON43 057 (Figure 9) that contained the 
cDNA fragment encoding the catalytic domain of 
Arabidopsis HMGR1, driven by 7S promoter and E9 3' 
terminator and the NPTII selectable marker gene driven 
by the NOS promoter and 3' NOS terminator. The 
pMON43057 was used for Agrobacterium tumefaciens 
mediated transformation of Glycine max cotyledon 
explants. The pMON43058 (Figure 10) construct carrying 
both the catalytic domain of Arabidopsis HMGR1 and 
Arabidopsis SMTII, both driven by the 7S promoter, was 
also used for Agrobacterium temefaciens-mediated 
transformation of Glycine max in a similar manner 
descibed below. 

Explants for transformation were prepared as 
follows: sterilized seeds were germinated on 
germination medium under light at 2 8°C for 5-6 days. 
Germinated seeds were placed in the dark at 4°C for 24 
hours prior to excision. Seed coats were removed and 
hypocotyls of each seedling trimmed to a length of 0.5 
cm to 1.0 cm in length. The cotyledons were then split 
open such that the hypocotyl was split down in the 
middle. The primary leaves and apical region of each 
cotyledon was removed to expose the wounding region. 
Wounding was performed with 3-7 shallow, scalpel scores 
in line with the embryo axis, ensuring that the apical 
bud was damaged. Wounded explants were incubated in 
the culture of Agrobacterium tumefaciens containing 
pMON43057. Incubation was for 1 hour at room 
temperature. Innoculated explants were then 
transferred to a co-culture medium and placed under 
light at 23°C for 3-4 days. At this time explants were 
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transferred to shooting medium without kanamycin 
selection and placed in a 25°C light growth room for 4 
days . 

After 4 days on delay, explants were transferred 
5 to a 186 ppm kanamycin selection medium and placed in a 
25°C light growth room for 2 weeks. At the end of two 
weeks explants were transferred to 18 6 ppm Woody Plant 
medium and placed again in a 2 5°C light growth room for 
another 2 weeks. Cultures were transferred every 2 
10 weeks to fresh medium for approximately 18-21 weeks. 
At the 6 week transfer, the cotyledons and any dead 
material were removed from the explants, and the 
petiole was cut. At each subsequent 2 week transfer, 
the petiole was cut to expose fresh cells to the 
15 medium. 

Transgenic shoots that were approximately W in 
length, with 2 nodes, 1 open trifoliate and an active 
growing point were selected, cut and transferred to 
rooting medium. Once a good root system was developed, 

20 the plants were sent to the greenhouse to grow up in 
soils in pots. 

Seeds from the 15 transgenic plants and one 
nontransgenic control plant were harvested at maturity. 
Ten individual seeds from each plant were weighed and 

25 ground into fine powder using an electric grinder. A 
known amount of cholestane (usually 100(xg in lOOjal 
ethanol) was added to each approximately 50mg powder 
sample. Sterol compounds were hydrolyzed directly from 
the ground tissue by saponification with 2ml of 10% KOH 

30 in methanol by refluxing the material at 6 0°C for 3 0 
minutes. The refluxed samples were cooled to room 
temperature and filtered through glass wool. An equal 
volume of water was added to each filtrate, and the 
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nonsaponif iables were extracted by partitioning three 
times with equal volumes of hexane. The hexane phases 
were pooled and evaporated. The residues were 
resuspended in 1 ml of acetone, and quantatively 

5 transferred to glass GC vials that were immediately 
capped. Sterols were analyzed by Gas Chromatography - 
Flame Ionizing Detector using the following conditions: 
Inlet temperature of 22 0°C, detector temperature of 
320°C, and column oven temperature programmed from 220°C 

10 to 320°C with initial temperature for 1 minute and final 
temperature for 16 minutes and ramp rate of 8°/min. The 
column used was a glass capillary DB-5 column of 50 m 
length, 320 \im diameter, and a film thickness of 0.25 
jj,m. The carrier gas was helium at a flow rate of 

15 l.Oml/min. Results are presented in Table 3 and Table 
4 . 
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To fully characterize the sterol compounds present 
in the transgenic seeds, a representative sample was 
also analyzed by Gas Chromatograpy-Mass Spectrometry 
(GC-MS) for confirmation of the sterol compounds 
present. The GC-MS conditions were as follows: inlet 
temp. 2 5 0°C, detector 32 0°C, oven programmed from 18 0°C 
to 325°C with initial equilibration time of 1.0 min, 
ramping to 310°C at 4°/min at then 20°/min to 325°C. The 
column was a DB-5 capillary glass column similar to the 
one used for GC-FID. 

Majority of the transgenic lines harboring 
pMON43057 showed 3 to 5-fold increase in total sterols. 
The best performing transgenic lines, GM_A13342 and 
GM_A13634, showed 6.5- and 6.1-fold increase in total 
sterols, respectively. These lines showed 2- to 2.6- 
fold increase in sitosterol, 1 . 5 to 2. 2 -fold increase 
in sitostanol and no significant change in the 
campesterol levels. Hence the major proportion of the 
total sterol increase was accounted by the accumulation 
of pathway intermediates which include squalene, 
cycloartenol, 24 -methylene cycloartenol , obtusif oliol , 
isofucosterol , and stigmasta-7 -enol . The best 
performing transgenic lines, GM_A13342 and GM_A13634, 
showed 32.6- and 3 2. 2 -fold increase in pathway 
intermediates accumulation, respectively, as compared 
to the control. In all the transgenic lines harboring 
the pMON43057, 50-70% of the total increase was 
accounted by the increase in the pathway intermediates 
accumulation as compared to the control. The pathway 
intermediates include squalene, cycloartenol, 24- 
methylene cycloartenol, obtusif oliol , isofucosterol, 
and stigmasta-7 -enol . 

Six transgenic lines haboring pMON43 058 produced 
5.8- to 6 -fold increase in total sterols and the rest 
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of the 10 transgenic lines with the pMON43058 showed 3- 
to 5- fold increase in total sterols. The best 
performing transgenic lines showed about 2- to 3 -fold 
increase in sitosterol and 4.5- to 6- fold increase in 
5 sitostanol levels. However, the campesterol 

accumulation was reduced by 50% in these lines. This 
was due to overexpression of the Arabidopsis SMTII 
enzyme which enhances the carbon flux towards the 
synthesis of 24 -ethyl sterols thereby reducing the 
10 carbon flux through the pathway leading to the 

synthesis of 24 -methyl sterols. As seen in pMON43057 
transgenic lines, all of the transgenic lines harboring 
the pMON43058 also accumulated 50-60% of the total 
sterols in the form of pathway intermediates which are 
15 squalene, cycloartenol , 24 -methylene cycloartenol , 
obtusifoliol, isofucosterol, and st igmasta- 7 -enol . 
These pathway intermediates normally form minor 
constituents in the sterol composition of seeds. 
However, in the transgenic seeds, probably due to 
20 increased carbon flux through the pathway, they 
accumulate in significant amounts. The pathway 
intermediates accumulation is highly significant when 
the truncated from of HMGR is overexpressed as compared 
to the full length form of HMGR suggesting that the 
25 overexpression of the truncated form of HMGR creates 
even greater increase in carbon flux through the 
pathway. This provides further evidence for additional 
control points for sterol biosynthesis in plants such 
as squalene epoxidase, sterol methyltransf erase I, 
30 sterol C4-demethylase, obtusifoliol C14oc-demethylase , 

sterol C5-desaturase, and sterol methyl transferase II. 
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Example 3 . Enhancement of phytosterol biosynthesis 
in seeds of Arabidopsis transgenic plants by 
constitutive expression of different forms of 
Arabidopsis and rubber HMGR enzymes. 

5 

Arabidopsis transgenic plants were generated using 
Agrobacterium mediated transformation of constructs 
(pMON53733, pMON53734, pMON53735, pMON53736, pMON53737, 
pMON53738, pMON53739, pMON53740) carrying cDNA encoding 

10 different forms of Arabidopsis and rubber HMGR enzymes 
driven by CaMV enhanced 35S promoter (Figures 13-20) . 
The transformed Arabidopsis seeds carrying each of the 
above constructs were selected on kanamycin (50^ig/ml) 
medium to select for transf ormant s expressing the 

15 selectable marker, the NPTII gene driven by CaMV 35S 
promoter. Kanamycin resistant Arabidopsis transgenic 
plants were grown in green house for maturity and seeds 
were collected from each of the transgenic lines for 
sterol analysis. About 50 mg of seeds from each 

20 transgenic line were weighed, homogenized and used for 
saponification to extract sterols as described in 
Example 2 . 

Figures 21-26 describe the sterol analysis data 
25 obtained from the transgenic lines carrying each of the 
above constructs. Figure 27 shows the effect on 
different sterol end products and pathway intermediate 
accumulation when different forms of rubber HMGR cDNAs 
were expressed const itutively in transgenic Arabidopsis 
30 plants. When truncated rubber HMGR (with or without 
linker region) was overexpressed the total sterol 
accumulation in seeds increased by 2.9 to 3. 7 -fold as 
compared with the wild type control plants. The sterol 
end products such as campesterol and sitosterol showed 
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1.5 to 2 -fold increase in the lines expressing 
truncated form of rubber HMGR (with and without 
linker) . However the sitostanol end product 
accumulation in the transgenic lines harboring the 
5 truncated form of rubber HMGR (with and without linker) 
was enhanced by 2.8 to 7- fold. There is a significant 
accumulation of pathway intermediates such as 
cycloartenol and 24 -methylene cycloartenol in the seeds 
of the transgenic lines transformed with the truncated 
10 form of rubber HMGR (with and without linker region) . 
The wild type control plants used in the experiment do 
not accumulate both of the pathway intermediates. 

Example 4 . Comparison of Steroid Compounds from 

15 HMGR Constructs in a Yeast HMGR1 Knockout Mutant 

The effects on the sterol levels of the expression 
of various HMGR constructs expressed in a yeast HMGR1 
knockout mutant were compared. Constructs containing a 
nucleic acid encoding the full length HMGR polypeptides 

20 from Arabidopsis and rubber were compared to those 
encoding a truncated Arabidopsis or rubber HMGR 
polypeptide that were lacking both the membrane binding 
and linker region of HMGR. The control yeast cells were 
transformed with a similar construct lacking a 

25 polypeptide encoding any form of HMGR. 

Yeast cells transformed with Arabidopsis HMGR and 
rubber HMGR constructs accumulated approximately the 
same amounts of zymosterol and ergosterol , but more 
squalene than the control yeast . 

30 Transformed yeast cells having rubber HMGR 

constructs accumulated about the same amount of 
ergosterol, but about twice as much squalene and 
zymosterol than the control yeast. 
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Transformed yeast cells having Arabidopsis tHMGR 
constructs accumulated three times as much squalene, 
twice as much zymosterol, and about 3 0 percent more 
ergosterol than the control yeast . 
5 Transformed yeast cells having rubber tHMGR 

constructs accumulated three times as much squalene, 
four times as much zymosterol, and about 50 percent 
more ergosterol than the control yeast . 

The data are shown in a Figure 28, "Plant HMGR1 
10 Contructs in Yeast HMGR1 Knockout Mutant". 

Example 5 . Gene sequences for all genes listed in 
the application 

The sequences obtained from the NCBI public 
15 database are SEQ ID NO.: 1,2,3,20,21,22,23. These 

sequences are included in the appendix and denoted as 
follows : 

Appendix A= SEQ ID NO. 1, 
Appendix B= SEQ ID NO . 2 , 
20 Appendix C= SEQ ID NO. 3, 

Appendix D= SEQ ID NO. 20, 
Appendix E= SEQ ID NO. 21, 
Appendix F= SEQ ID NO. 22, 
Appendix G= SEQ ID NO. 23. 

25 

SEQ ID 1 = Arabidopsis squalene epoxidase protein 
sequence (Accession NO: AC004786) See Appendix A 

SEQ ID 2 = Arabidopsis squalene epoxidase 
(Accession NO: N64916) See Appendix B 
30 SEQ ID 3 = Arabidopsis squalene epoxidase 

(Accession NO: T44667) See Appendix C 



SEQ ID 4 = Arabidopsis squalene epoxidase (clone 
ID: ATA506263) nucleotide sequence 

35 
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GAATTCCCGGGTCGACCCACGCGTCCGCTTATAGATAAGGATATGGCCTT 
TACGAACGTTTGCCTATGGACGCTACTCGCCTTCATGCTGACTTGGACAGTGTTC 
TACGTCACAAACAGGGGGAAGAAGGCGACGCAGTTGGCGGATGCGGTGGTTGAAG 
AGCGAGAAGACGGTGCTACTGACGTTATCATCGTTGGGGCTGGAGTAGGCGGCTC 
5 GGCTCTCGCATATGCTCTTGCTAAGGACGGGCGTCGAGTCCATGTAATAGAGAGG 
GACCTGAGAGAACCAGAGAGAATCATGGGTGAGTTTATGCAACCAGGAGGACGAC 
TCATGCTCTCTAAGCTTGGTCTTGAAGATTGTTTGGAGGGAATAGATGCCCAAAA 
AGCCACGGGCATGACAGTTTATAAGGACGGAAAAGAAGCAGTCGCATCTTTTCCC 
GTGGACAACAACAATTTTCCTTTTGATCCTTCGGCTCGATCTTTTCACAATGGCC 

10 GATTCGTCCAACGATTGCGGCAAAAGGCTTCTTCTCTTCCCAATGTGCGCCTGGA 
AGAAGGAACGGTGAAGTCTTTGATAGAAGAAAAAGGAGTGATCAAAGGAGTGACA 
TACAAAAATAGCGCAGGCGAAGAAACAACAGCCTTGGCACCTCTCACTGTAGTAT 
GCGACGGTTGCTACTCAAACCTTCGCCGGTCTCTTAATGACAACAATGCGGAGGT 
TCTGTCATACCAAGTTGGTTTTATCTCAAAGAACTGTCAGCTTGAAGAACCCGAA 

15 AAGTTAAAGTTGATAATGTCTAAACCCTCCTTCACCATGTTGTATCAAATCAGCA 
GCACCGACGTTCGTTGTGTTTTTGAAGTTCTCCCCAACAACATTCCTTCTATTTC 
AAATGGTGAAATGGCTACTTTCGTGAAGAACACTATTGCTCCTCAGGTACCTTTA 
AAACTCCGCAAAATATTTTTGAAAGGGATTGATGAAGGAGAACATATAAAAGCCA 
TGCCAACAAAGAAGATGACAGCTACTTTGAGCGAGAAGAAAGGAGTGATTTTATT 

20 GGGAGATGCATTCAACATGCGTCATCCAGCAATCGCATCTGGAATGATGGTTTTA 
TTATCTGACATTCTCATTTTACGCCGTCTTCTCCAGCCATTAAGCAACCTTGGCA 
ATGCGCAAAAAATCTCACAAGTTATCAAGTCCTTTTATGATATCCGCAAGCCAAT 
GTCAGCGACAGTTAACACGTTAGGAAATGCATTCTCTCAAGTGCTAGTTGCATCG 
ACGGACGAAGCAAAAGAGGCAATGAGACAAGGTTGCTATGATTACCTCTCTAGTG 

25 GTGGGTTTCGCACGTCAGGGATGATGGCTTTGCTAGGCGGCATGAACCCTCGTCC 
GATCTCTCTCATCTATCATCTATGTGCTATCACTCTATCCTCCATTGGCCATCTA 
CTCTCTCCATTTCCCTCTCCCCTTGGCATTTGGCATAGCCTTCGACTTTTTGGTT 
TGGCTATGAAAATGTTGGTTCCCCATCTCAAGGCTGAAGGAGTTAGCCAAATGTT 
GTTTCCAGTCAACGCCGCCGCGTATAGCAAAAGCTATATGGCTGCAACGGCTCTT 

30 TAAAACACTGGTGCTTTAAACTGCAAAATATAACACATATATAAATCCCGAATCT 
TTGTGATTCTGCATATATTGTGTTCTACAATTATTCTCATATAAATGAAAATTGT 
TCTACGTAAAAGTAAAAAGAAGGAATTGTAATACTAATAAAACGAGTTTTTAATT 
CTGTTGAATGCTTGTGTATATTGGTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAGGGCGGCCGC 

35 

SEQ ID 5 = Arabidopsis squalene epoxidase (clone 
ID: ATA506263) amino acid translation 

EFPGRPTRPLIDKDMAFTNVCLWTLLAFMLTWTVFYVTNRGKKATQLADA 
40 WEEREDGATDVI IVGAGVGGSALAYALAKDGRRVHVIERDLREPERIMGEFMQP 
GGRLML S KLGLEDCLE G I DAQKATGMTVYKDGKEAVAS F P VDNNNF P FD P S AR S F 
HNGRFVQRLRQKASSLPNVRLEEGTVKSLIEEKGVIKGVTYKNSAGEETTALAPL 
TWCDGCYSNLRRSLNDNNAEVLSYQVGFISKNCQLEEPEKLKLIMSKPSFTMLY 
QISS TDVRCVFEVLPNNI P S I SNGEMATFVKNT I APQVPLKLRKI FLKG I DEGEH 
45 IKAMPTKKMTATLSEKKGVILLGDAFNMRHPAIASGMMVLLSDILILRRLLQPLS 
NLGNAQKI SQVI KSFYDIRKPMSATVNTLGNAFSQVLVASTDEAKEAMRQGCYDY 
LSSGGFRTSGMMALLGGMNPRPISLIYHLCAITLSSIGHLLSPFPSPLGIWHSLR 
LFGLAMKMLVPHLKAEGVSQMLFPVNAAAYSKSYMAATAL* 

50 SEQ ID 6 = Arabidopsis squalene epoxidase (clone 
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GAATTCCCGGGTCGACCCACGCGTCCGCGGACGCGTGGGATTGAGAACAA 
ATAGATTTGGTTATATATGGCTTTTACGCACGTTTGTTTATGGACGTTAGTCGCC 
5 TTCGTGCTGACGTGGACGGTGTTCTACCTTACCAACATGAAGAAGAAGGCGACGG 
ATTTGGCTGATACGGTGGCTGAGGATCAAAAAGACGGTGCTGCTGACGTCATTAT 
CGTCGGGGCTGGTGTAGGTGGTTCGGCTCTCGCATATGCTCTTGCTAAGGATGGG 
CGTCGAGTACATGTGATCGAGAGGGACATGAGAGAACCAGAAAGAATGATGGGTG 
AGTTTATGCAACCTGGCGGACGACTCATGCTTTCTAAACTTGGCCTTCAAGATTG 

10 CTTGGAAGACATAGATGCACAGAAAGCCACGGGTTTGGCAGTTTATAAAGATGGA 
AAAGAAGCAGACGCACCTTTTCCAGTGGATAACAACAATTTTTCTTATGAACCTT 
CTGCTCGATCTTTTCACAATGGCCGATTCGTCCAACAACTGCGTCGAAAGGCTTT 
TTCTCTTTCCAATGTGCGCCTGGAAGAAGGAACGGTGAAGTCTTTACTAGAAGAA 
AAAGGAGTGGTCAAAGGAGTGACATACAAGAATAAAGAAGGCGAAGAAACAACAG 

15 CCTTGGCACCTCTCACTGTGGTATGCGACGGTTGCTACTCAAACCTTCGTCGGTC 
TCTTAATGATGACAACAATGCTGAGATTATGTCGTACATAGTTGGTTACATCTCA 
AAGAATTGTCGGCTTGAAGAACCCGAAAAGCTACACTTGATATTGTCTAAACCAT 
CTTTCACCATGGTATACCAAATAAGCAGCACTGACGTTCGTTGTGGTTTTGAGGT 
TCTCCCCGAAAATTTTCCTTCTATTGCAAATGGTGAAATGTCTACTTTCATGAAG 

20 AATACTATAGTTCCTCAGGTACCTCCAAAACTCCGCAAAATATTTTTGAAAGGTA 
TAGATGAGGGAGCACACATAAAAGTGGTGCCGGCAAAGCGCATGACATCTACTTT 
AAGCAAGAAGAAAGGTGTGATTGTATTGGGAGATGCATTCAATATGCGTCATCCA 
GTTGTTGCATCTGGAATGATGGTTTTACTGTCGGACATTCTCATTCTACGCCGTC 
TTCTTCAGCCATTAAGCAACCTCGGCGATGCAAACAAAGTCTCAGAAGTTATCAA 

25 TTCCTTTTATGATATCCGCAAGCCAATGTCGGCGACGGTTAACACATTGGGAAAT 
GCATTTTCTCAAGTACTAATTGGATCAACGGATGAAGCAAAAGAGGCAATGAGAC 
AGGGTGTCTATGATTACCTTTGTAGTGGCGGGTTTCGTACGTCAGGGATGATGGC 
TCTGCTCGGCGGCATGAATCCTCGTCCTCTCTCTCTCGTCTATCATCTTTGTGCC 
ATCACTCTATCCTCCATTGGCCAACTGCTCTCTCCATTTCCCTCTCCCCTTCGCA 

30 TTTGGCATAGCCTCAAGCTTTTTGGTTTGGCCATGAAAATGTTGGTTCCCAATCT 
CAAAGCTGAAGGAGTTAGCCAAATGTTGTTTCCAGCAAATGCAGCCGCGTATCAC 
AAAAGCTATATGGCTGCAACCACTCTCTAAACTTTGATGCTCTCAATCGCAATAT 
ATATGGAGCACGAATCTATGTGATTGTGCATTTGGTAAACGTGTATTGCAGTGCT 
TATAATTATTAGTATGTAACGGGGAAAAGTTCTAAACACAAAAAAATAAACTTTT 

35 GAATGTTATATGTGTGAATTATTTTTGTTGTTACAAGTAATGCTCTTTTTTTTTA 
GCTTCACACATGTATTATTGGAGCTAATTTTTTGTTTCTCTGTTCTTTTATTTTT 
GTTTTCTTACTGTATTTACTTTGAAAAGTTTCGTTTTATACATATTGGACATTTT 
TTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGG 

40 GCGGCCGC 

SEQ ID 7 = Arabidopsis squalene epoxidase (clone 
ID: ATA3 04243) amino acid translation 

45 MAFTHVCLWTLVAFVLTWTVFYLTNMKKKATDLADTVAEDQKDGAADVI I VGAGV 
GGSALAYALAKDGRRVHVI ERDMRE PERMMGEFMQPGGRLMLS KLGLQDCLED I D 
AQKATGLAVYKDGKEADAPFPVDNNNFSYEPSARSFHNGRFVQQLRRKAFSLSNV 
RLEEGTVKSLLEEKGWKGVTYKNKEGEETTALAPLTWCDGCYSNLRRSLNDDN 
NAEIMSYIVGYISKNCRLEEPEKLHLILSKPSFTMVYQISSTDVRCGFEVLPENF 

50 PSIANGEMSTFMKNTIVPQVPPKLRKIFLKGIDEGAHIKWPAKRMTSTLSKKKG 
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VIVLGDAFNMRHPWASGMMVLLSDILILRRLLQPLSNLGDANKVSEVINSFYDI 
RKPM S ATVNTLGNAF SQVLIGS TDE AKE AMRQGVYD YLC S GGFRT S GMMALLGGM 
NPRPLSLVYHLCAITLSSIGQLLSPFPSPLRIWHSLKLFGLAMKMLVPNLKAEGV 
SQ MLFPANAAAYHKSYMAATTL* 

5 

SEQ ID 8 = Arabidopsis squalene epoxidase (clone 
ID: ATA102071) nucleotide sequence 

AAATCATATTGAGAACAAATAGATTTGGTTATATATGGCTTTTACGCACG 
10 TTTGTTTATGGACGTTAGTCGCCTTCGTGCTGACGTGGACGGTGTTCTACCTTAC 
CAACATGAAGAAGAAGGCGACGGATTTGGCTGATACGGTGGCTGAGGATCAAAAA 
GACGGTGCTGCTGACGTCATTATCGTCGGGGCTGGTGTAGGTGGTTCGGCTCTCG 
CATATGCTCTGCTAAGTGTGCGCCTGGAAGAAGGAACGGTGAAGTCTTTACTAGA 
AGAAAAAGGAGTGGTCAAAGGAGTGACATACAAGAATAAAGAATGCGAACAAACA 
15 ACAGCCTTGGCACCTCTCACTGTGGTATGCGACGGTTGCTAATCAAACCTTCGTC 
GGTCTCTTAATG 

SEQ ID 9 = Arabidopsis squalene epoxidase (clone 
ID: ATA102071) amino acid translation 

20 

MAFTHVCLWTLVAFVLTWTVFYLTNMKKKATDLADTVAEDQKDGAADVII 
VGAGVGGSALAYALLSVRLEEGTVKSLLEEKGWKGVTYKNKECEQTTALAPLTV 
VCDGC 

25 

SEQ ID 10 = Arabidopsis squalene epoxidase (clone 
ID: ATA504158) nucleotide sequence 

CACAAAGCAAAAAAATCTCTGTAAAAGCAGAACGATAATGGAGTCACAAT 
30 TATGGAATTGGATCTTACCTCTTTTGATCTCTTCTCTGCTCATCTCCTTCGTCGC 
TTTCTATGGATTCTTCGTCAAACCGAAGCGGAACGGTCTCCGTCACGATCGGAAA 
ACTGTTTCTACCGTCACCTCCGACGTCGGATCTGTTAATATTACCGGAGATACTG 
TCGCTGATGTCATTGTTGTTGGAGCTGGTGTTGCTGGTTCTGCTCTTGCTTATAC 
TCTTGGAAAGGGGAAATTTAAACGCCGAGTTCATGTGATTGAAAGAGATTTATCG 
35 GAGCCTGATCGTATTGTTGGGGAGTTGTTACAGCCTGNGGGTTACCTCAAGTTAC 
TGGAGTGTGGAATTGGAGATTGTGTGGAAGAAATAGATGCTCAGCNTGTGTATGG 
TTATGCACTTTTTAAAAATGGG 



SEQ ID 11 = Arabidopsis squalene epoxidase (clone 
40 ID: ATA504158) amino acid translation 

TKQKNLCKSRT IMESQLWNWI LPLL I S SLL I S FVAFYGFFVKPKRNGLRH 
DRKTVS TVT SDVG S VNI TGDTVADVI WGAGVAGS ALAYTLGKGKFKRRVHVI ER 
DLSEPDRIVGELLQPXGYLKLLECGIGDCVEEIDAQXVYGYALFKNG 



45 

SEQ ID 12 = Arabidopsis obtusifoliol C14oc- 

demethylase nucleotide sequence (Accession NO: 

complement, join AC002329 : 37461...38456 , 
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AC002329 : 39121...39546) (homolog of sorghum obtusifoliol 
C14a-demethylase) nucleotide sequence 

CGTGTTTTACA7AATTTCCTTTGTTGGTTTTCCACAGATTTAAAGAACCCT 
5 AACGAGAGAAAAAAATGGACTGGGATTACTATACGCTGTTGAAGACGAGTGTGGC 
TATTATTATAGTGTTTGTTGTGGCCAAACTCATAACCTCCTCCAAATCCAAGAAG 
AAAACAAGTGTCGTCCCACTCCCTCCAGTTCTTCAAGCGTGGCCTCCATTTATCG 
GATCCCTAATCCGCTTCATGAAAGGTCCAATAGTGCTACTTAGAGAGGAATATCC 
TAAGCTTGGAAGTGTTTTCACAGTGAAGCTTCTTCACAAAAACATCACTTTTCTC 

10 ATCGGTCCCGAAGTCTCGTCCCACTTTTTCAACGCTTATGAATCTGAACTCAGCC 
AGAAAGAAATTTACAAATTTAATGTGCCTACTTTTGGCCCCGGAGTTGTGTTTGA 
TGTTGACTATCCCGTTCGGATGGAGCAGTTCCGATTCTTCTCCAGCGCTCTCAAG 
GATTACTTCTCAAAATGGGGAGAAAGTGGGGAAGTGGATCTAAAGGC C GAGTTAG 
AGCGTCTAATCACCTTGACTGCTAGTAGATGTCTATTGGGTCGAGAAGTCCGTGA 

15 CCAACTTTTTGATGATGTTGCTCCATTGTTCCATGACCTTGATAAAGGCATGCAA 
CCCATAAGTGTCATCTTCCCAAAGCTCCCCATTCCAGCTCACAATTGTCGTGACC 
GTGCTCGCGGAAAGATTGCAAAAATCTTTTCAAACATCATAGCAACAAGAAAACG 
CTCTGGTGACAAATCAGAGAACGACATGCTACAATGTTTCATCGACTCAAAGTAC 
AAAGACGGTAGAGAGACAACTGAATCTGAAGTAACTGGTTTGCTCATTGCTGGTT 

20 TGTTTGCAGGACAACATACAAGCTCTATCACTGCCACATGGACCGGTGCTTATCT 
AATTCAAAACAAACACTGGTGGTCCGCGGCTTTGGACGAGCAGAAGAAACTGATT 
GGAAAACATGGGGACAAGATCGACTACGATGTTTTGTCTGAGATGGATTTTCTGT 
TTCGCAGTGCAAAAGAAGCTTTAAGGCTTCACCCTCCAAAGATCTTACTGCTGAG 
AACAGTACACAGTGATTTCACCGTGACAACTCGAGAAGGAAAGCAATATGAGATA 

25 CCAAAGGGTCATATCGTTGCAACTTCTCCTGCATTCGCCAACCGCTTACCTCATG 
TCTACAAAGATCCGGAAAATTTTGATCCGGATAGATTTTCAAAGGAAAGAGAAGA 
GGATAAAGCAGCTGGTTCGTGTTCATACATCTCTTTGGGAGCTGGTAGGCACGAG 
TGTCCTGGTGGATCATTTGCGTTCTTGCAGATCAAAGCCGTATGGTGTCACTTAT 
TGAGAAACTTTGAGCTTGAGTTAGTGTCACCTTTCCCTGAAATCAACTGGAATGC 

30 TTTGGTCGTTGGTGCTAAAGGAAATGTCATGGTTCGTTACAAGCGTCGTCCCTTT 
TCTTAA 



SEQ ID 13 = Arabidopsis obtusifoliol C14a- 
35 demethylase nucleotide sequence (Accession NO: 
complement, join AC002329 : 37461...38456 , 

AC002329 : 39121...39546) (homolog of sorghum obtusifoliol 
C14a-demethylase) amino acid translation 

40 MDWDYYTLLKTSVAI I IVFWAKLITSSKSKKKTSWPLPPVLQAWPPFI 

GSL I RFMKGP I VLLREEYPKLGSVFTVKLLHKMI TFL I GPEVS SHFFNAYESELS 
QKEIYKFNVPTFGPGWFDVDYPVRMEQFRFFSSALKDYFSKWGESGEVDLKAEL 
ERLITLTASRCLLGREVRDQLFDDVAPLFHDLDKGMQPISVIFPKLPIPAHNCRD 
RARGKIAKIFSNIIATRKRSGDKSENDMLQCFIDSKYKDGRETTESEVTGLLIAG 

45 LFAGQHT SSI TATWTGAYL I QNKHWWS AALDEQKKL I GKHGDKI DYDVL S EMDFL 
FRSAKEALRLHPPKILLLRTVHSDFTVTTREGKQYEIPKGHIVATSPAFANRLPH 
VYKDPENFDPDRFSKEREEDKAAGSCSYISLGAGRHECPGGSFAFLQIKAWCHL 
LRNFELELVSPFPE INWNALWGAKGNVMVRYKRRPFS * 

50 SEQ ID 14 = Arabidopsis obtusifoliol C14a- 
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demethylase (clone ID: ATA101105) nucleotide sequence 



GACACTATAGAAGAGCTATGACGTCGCATGCACGCGTACGTAAGCTCGGA 
ATTCGGCTCGAGCTTGTTCACAAAAAGATTACTTTTCTTATTGGTCCTGAAGTCT 
5 CTGCTCATTTTTTCAAAGCTTCTGAATCTGATCTTAGTCAGCAGGAAGTGTATCA 
GTTCAATGTCCCTACTTTTGGTCCTGGAGTTGTTTTCGATGTTGATTATTCTGTT 
TCGTCAGGAGCAGTTCGGTTCTTCACTGAGGCACTTAGAGTTAACAAGTTGAAGG 
GTTATGTGGATATGATGGTTACTGAAGCTGAGGATTACTTCTCTAAATGGGGAGA 
GAGTGGTGAAGTTGATATTAAGGTTGAGCTAGAGAGGCTCATCATCTTGACTGCA 
10 AGTGATGTTTACTGGGTCGAGAAGTTCGTGATCAGCTTTTTGATGATGTCTCTGC 
TTTGTTCCATGACCTTGACAATGGAATGCTTCCCATCAGTGCTTCCCATCAGTGT 
TCTCTTCCCATATCTCCCAATTCCAGCTCACCG 

SEQ ID 15 = Arabidopsis obtusifoliol C14a- 
15 demethylase (clone ID: ATA101105) amino acid translation 

HYRRAMTSHARVRKLGIRLELVHKKITFLIGPEVSAHFFKASESDLSQQE 
VYQFNVPTFGPGWFDVDYSVRQEQFGSSLRHLELTS 

20 SEQ ID 16 = Arabidopsis obtusifoliol C14a- 

demethylase (clone ID: ATA202967) nucleotide sequence 

TCGACCCCGCGTCCGCGGACGCGTGGGATCAGCTTCAAGCTTAAGAGAGC 
TTCGAAAGCGAAAGCGACGATTTCTTCTCCATCGTGAGAGCAAATCTCCAGAGCC 

25 GTTTTCTCTTCTTCTTCTTCCTCCTCGCGCCGTCTCTGAAACTCCATCATCGTAT 
CAATCAAATTGCTTCCTCCTCCAAATTGAAAAACAATGGAATTGGATTCGGAGAA 
CAAAT TGTTGAAGACGGGTTTGGTTATAGT GGCGACAC TTGT T ATAGC C AAACT C 
ATCTTCTCTTTCTTCACTTCTGATTCTAAGAAGAAGCGTCTTCCTCCTACTCTTA 
AAGCTTGGCCTCCATTGGTTGGAAGTCTTATCAAATTCTTGAAAGGACCTATTAT 

30 TATGCTTAGAGAGGAATACCCTAAGCTTGGAAGTGTGTTTACTGTTAATCTTGTT 
CACAAAAAGATTACTTTTCTTATTGGTCCTGAAGTCTCTGCTCATTTTTTCAAAG 
CTTCTGAATCTGATCTTAGTCAGCAGGAAGTGTATCAGTTCAATGTCCCTACTTT 
TGGTCCTGGAGTTGTTTTCGATGTTGATTATTCTGTTCGTCAGGAGCAGTTTCGG 
TTCTTCACTGAGGCACTTAGAGTTAACAAGTTGAAGGGTTATGTGGATATGATGG 

35 TTACTGAAGCTGAGGATTACTTCTCTAAATGGGGAGAGAGTGGTGAAGTTGATAT 
TAAGGTTGAGCTAGAGAGGCTCATCATCTTGACTGCAAGTAGATGTTTACTGGGT 
CGAGAAGTTCGTGATCAGCTTTTTGATGATGTCTCTGCTTTGTTCCATGACCTTG 
ACAATGGAATGCTTCCCATCAGTGTTCTCTTCCCATATCTCCCAATTCCAGCTCA 
CCGCCGTCGTGACCGTGCCCGAGAAAAGCTTTCGGAGATTTTCGCAAAAATCATT 

40 GGGTCGAGAAAACGCTCTGGAAAAACAGAGAACGACATGCTGCAGTGTTTCATCG 
AATCAAAGTACAAAGATGGTAGACAGACAACCGAATCTGAAGTCACTGGTTTGCT 
CATTGCTGCTCTGTtTGCAGGACAACACACGAGCTCTATCACTTCCACCTGGACC 
GGTGCTTATCTGATGCGATACAAAGAGTACTTCTCAGCTGCTCTTGATGAGCAGA 
AGAACCTGATTGCGAAACATGGAGACAAGATCGATCATGATATCTTATCCGAGAT 

45 GGATGTTCTCTACCGCTGCATTAAGGAAGCGTTGAGGCTTCACCCTCCACTCATC 
ATGTTAATGAGAGCCTCGCACAGTGATTTCAGCGTGACAGCTCGGGATGGAAAAA 
CTTACGATATCCCAAAGGGTCACATCGTTGCAACCTCCCCTGCATTTGCCAACCG 
CTTACCGCACATCTTCAAAGACCCCGACACCTACGACCCAGAAAGATTCTCCCCT 
GGAAGAGAAGAGGACAAAGCCGCAGGGGCATTCTCGTACATTGCATTCGGAGGGG 

50 GAAGGCACGGGTGCCTTGGAGAGCCGTTTGCTTACCTGCAGATCAAAGCCATATG 
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GAGTCATTTGTTGAGGAACTTCGAGCTTGAGCTAGTTTCACCGTTCCCTGAGATT 
GACTGGAACGCTATGGTGGTTGGAGTTAAAGGCAATGTGATGGTGCGTTACAAGA 
GGCgcCAGCTTTCTTAAAGACAAGTTTAAGGTTATTGCAGCTTTGGATTTTTCTC 
TCTGGTTTCTGCTTTGCTTTTGTCCCTCTCTGGTTTTAGTTTTGTTGTTGAATAA 
TTCTTCTGTTTTTATAAACTGTTGTTACTCTTTAATTGACATTTATTTTTAAGCT 
TCCTAAGTTTGTGGTTCAAAAAAAAAAAAAGGCGGCGTTACT 

SEQ ID 17 = AraJoidopsis obtusifoliol C14cc- 
demethylase (clone ID: ATA202967) amino acid translation 

MELDSENKLLKTGLVIVATLVIAKLIFSFFTSDSKKKRLPPTLKAWPPLVGSLIK 
FLKGPI IMLREEYPKLGSVFTVNLVHKKITFLIGPEVSAHFFKASESDLSQQEVY 
QFNVPTFGPGWFDVDYSVRQEQFRFFTEALRVNKLKGYVDMMVTEAEDYFSKWG 

ESGEVD I KVELERL IILTAS RCLLGREVRDQLFDDVS ALFHDLDNGMLP I S VLF P 
YLPIPAHRRRDRAREKLSEIFAKIIGSRKRSGKTENDMLQCFIESKYKDGRQTTE 
S EVTGLL I AALFAGQHT S S I T S TWTGAYLMRYKE YF SAALDEQKNL I AKHGDKI D 
HD I L S EMDVL YRC I KE ALRLHP P L I MLMRASHSDF S VTARDGKT YD I P KGH I VAT 
SPAFANRLPHIFKDPDTYDPERFSPGREEDKAAGAFSYIAFGGGRHGCLGEPFAY 

LQI KAI WSHLLRNFELELVS PFPE I DWNAMWGVKGNVMVRYKRRQL S * 

SEQ ID 18 = Arabidopsis obtusifoliol C14oc- 
demethylase (clone ID: ATA403931) nucleotide sequence 

TCGACCCCGCGTCCGCGGACGCGTGGGATCAGCTTCAAGCTTAAGAGAGC 
TTCGAAAGCGAAAGCGACGATTTCTTCTCCATCGTGAGAGCAAATCTCCAGAGCC 
GTTTTCTCTTCTTCTTCTTCCTCCTCGCGCCGTCTCTGAAACTCCATCATCGTAT 
CAATCAAATTGCTTCCTCCTCCAAATTGAAAAACAATGGAATTGGATTCGGAGAA 
CAAATTGTTGAAGACGGGTTTGGTTATAGTGGCGACACTTGTTATAGCCAAACTC 
ATCTTCTCTTTCTTCACTTCTGATTCTAAGAAGAAGCGTCTTCCTCCTACTCTTA 
AAGCTTGGCCTCCATTGGTTGGAAGTCTTATCAAATTCTTGAAAGGACCTATTAT 
TATGCTTAGAGAGGAATACCCTAAGCTTGGAAGTGTGTTTACTGTTAATCTTGTT 
CACAAAAAGATTACTTTTCTTATTGGTCCTGAAGTCTCTGCTCATTTTTTCAAAG 
CTTCTGAATCTGATCTTAGTCAGCAGGAAGTGTATCAGTTCAATGTCCCTACTTT 
TGGTCCTGGAGTTGTTTTCGATGTTGATTATTCTGTTCGTCAGGAGCAGTTTCGG 
TTCTTCACTGAGGCACTTAGAGTTAACAAGTTGAAGGGTTATGTGGATATGATGG 
TTACTGAAGCTGAGGATTACTTCTCTAAATGGGGAGAGAGTGGTGAAGTTGATAT 
TAAGGTTGAGCTAGAGAGGCTCATCATCTTGACTGCAAGTAGATGTTTACTGGGT 
CGAGAAGTTCGTGATCAGCTTTTTGATGATGTCTCTGCTTTGTTCCATGACCTTG 
ACAATGGAATGCTTCCCATCAGTGTTCTCTTCCCATATCTCCCAATTCCAGCTCA 
CCGCCGTCGTGACCGTGCCCGAGAAAAGCTTTCGGAGATTTTCGCAAAAATCATT 
GGGTCGAGAAAACGCTCTGGAAAAACAGAGAACGACATGCTGCAGTGTTTCATCG 
AATCAAAGTACAAAGATGGTAGACAGACAACCGAATCTGAAGTCACTGGTTTGCT 
CATTGCTGCTCTGTtTGCAGGACAACACACGAGCTCTATCACTTCCACCTGGACC 
GGTGCTTATCTGATGCGATACAAAGAGTACTTCTCAGCTGCTCTTGATGAGCAGA 
AGAACCTGATTGCGAAACATGGAGACAAGATCGATCATGATATCTTATCCGAGAT 
GGATGTTCTCTACCGCTGCATTAAGGAAGCGTTGAGGCTTCACCCTCCACTCATC 
ATGTTAATGAGAGCCTCGCACAGTGATTTCAGCGTGACAGCTCGGGATGGAAAAA 
CTTACGATATCCCAAAGGGTCACATCGTTGCAACCTCCCCTGCATTTGCCAACCG 
CTTACCGCACATCTTCAAAGACCCCGACACCTACGACCCAGAAAGATTCTCCCCT 
GGAAGAGAAGAGGACAAAGCCGCAGGGGCATTCTCGTACATTGCATTCGGAGGGG 
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GAAGGCACGGGTGCCTTGGAGAGCCGTTTGCTTACCTGCAGATCAAAGCCATATG 
GAGTCATTTGTTGAGGAACTTCGAGCTTGAGCTAGTTTCACCGTTCCCTGAGATT 
GACTGGAACGCTATGGTGGTTGGAGTTAAAGGCAATGTGATGGTGCGTTACAAGA 
GGCgcCAGCTTTCTTAAAGACAAGTTTAAGGTTATTGCAGCTTTGGATTTTTCTC 
5 TCTGGTTTCTGCTTTGCTTTTGTCCCTCTCTGGTTTTAGTTTTGTTGTTGAATAA 
TTCTTCTGTTTTTATAAACTGTTGTTACTCTTTAATTGACATTTATTTTTAAGCT 
TCCTAAGTTTGTGGTTCAAAAAAAAAAAAAGGCGGCGTTACT 

SEQ ID 19 = Arabidopsis obtusif oliol C14a- 
10 demethylase (clone ID: ATA403931) amino acid translation 

MELDSENKLLKTGLVIVATLVIAKLIFSFFTSDSKKKRLPPTLKAWPPLVGSLIK 
FLKGPIIMLREEYPKLGSVFTWLVHKKITFLIGPEVSAHFFKASESDLSQQEVY 
QFNVPTFGPGWFDVDYSVRQEQFRFFTEALRVNKLKGYVDMMVTEAEDYFSKWG 

15 ESGEVDIKVELERLI ILTASRCLLGREVRDQLFDDVSALFHDLDNGMLPI SVLFP 
YLP I P AHRRRDRAREKL S E I FAKI I GSRKRSGKTENDMLQCF I E S KYKDGRQTTE 
SEVTGLLIAALFAGQHTSSITSTWTGAYLMRYKEYFSAALDEQKNLIAKHGDKID 
HD I L S EMDVL YRC I KEALRLHP P L I MLMRAS HS D F S VTARDGKT YD I PKGH I VAT 
SPAFANRLPHIFKDPDTYDPERFSPGREEDKAAGAFSYIAFGGGRHGCLGEPFAY 

20 LQIKAIWSHLLRNFELELVSPFPEIDWNAMWGVKGNVMVRYKRRQLS* 

SEQ ID 20 = Arabidopsis sterol methyl transferase 

I protein sequence (Accession NO: U71400) See Appendix 
D 

25 

SEQ ID 21 = Tobacco sterol methyl transferase I 
protein sequence (from Prof. Pierre Benveniste 
Accession NO: U81312) See Appendix E 

30 SEQ ID 22 = Arabidopsis sterol methyl transferase 

II protein sequence (Accession NO: X89867) (from Prof. 
Pierre Benveniste) See Appendix F 

SEQ ID 23 = Arabidopsis sterol C5 -desaturase 
35 protein sequence (Accession NO: X90454) See Appendix G 

SEQ ID 24 = Rubber truncated HMGRlml (S566 to A) 
nucleotide sequence 

ATGGCACGCGCCTCCCATGACGTGTGGGACCTCGAAGATACGGATCCCAACTACC 
40 TCATCGATGAAGATCACCGTCTCGTTACTTGCCCTCCCGCTAATATATCTACTAA 
GACTACCATTATTGCCGCACCTACCAAATTGCCTACCTCGGAACCCTTAATTGCA 
CCCTTAGTCTCGGAGGAAGACGAAATGATCGTCAACTCCGTCGTGGATGGGAAGA 
TACCCTCCTATTCTCTGGAGTCGAAGCTCGGGGACTGCAAACGAGCGGCTGCGAT 
TCGACGCGAGGCTTTGCAGAGGATGACAAGGAGGTCGCTGGAAGGCTTGCCAGTA 
45 GAAGGGTTCGATTACGAGTCGATTTTAGGACAATGCTGTGAAATGCCAGTGGGAT 
ACGTGCAGATTCCGGTGGGGATTGCGGGGCCGTTGTTGCTGAACGGGCGGGAGTA 
CTCTGTTCCAATGGCGACCACGGAGGGTTGTTTGGTGGCGAGCACTAATAGAGGG 
TGTAAGGCGATTTACTTGTCAGGTGGGGCCACCAGCGTCTTGTTGAAGGATGGCA 
TGACAAGAGCGCCTGTTGTAAGATTCGCGTCGGCGACTAGAGCCGCGGAGTTGAA 
50 GTTCTTCTTGGAGGATCCTGACAATTTTGATACCTTGGCCGTAGTTTTTAACAAG 
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TCCAGTAGATTTGCGAGGCTCCAAGGCATTAAATGCTCAATTGCTGGTAAGAATC 
TTTATATAAGATTCAGCTGCAGCACTGGCGATGCAATGGGGATGAACATGGTTTC 
TAAAGGGGTTCAAAACGTTCTTGAATTTCTTCAAAGTGATTTTTCTGATATGGAT 
GTCATTGGAATCTCAGGAAATTTTTGTTCGGATAAGAAGCCTGCTGCTGTAAATT 
5 GGATTGAAGGACGTGGCAAATCAGTTGTTTGTGAGGCAATTATCAAGGAAGAGGT 
GGTGAAGAAGGTGTTGAAAACCAATGTGGCCTCCCTAGTGGAGCTTAACATGCTC 
AAGAATCTTGCTGGTTCTGCTGTTGCTGGTGCTTTGGGTGGATTTAATGCCCATG 
CAGGCAACATCGTATCTGCAATCTTTATTGCCACTGGCCAGGATCCAGCACAGAA 
TGTTGAGAGTTCTCATTGCATTACCATGATGGAAGCTGTCAATGATGGAAAGGAT 

10 CTCCATATCTCTGTGACCATGCCCTCCATTGAGGTGGGTACAGTCGGAGGTGGAA 
CTCAACTTGCATCTCAGTCTGCTTGTCTCAATTTGCTTGGGGTGAAGGGTGCAAA 
CAAAGAGTCGCCAGGATCAAACTCAAGGCTCCTTGCTGCCATCGTAGCTGGTTCA 
GTTTTGGCTGGTGAGCTCTCCTTGATGTCTGCCATTGCAGCTGGGCAGCTTGTCA 
AGAGTCACATGAAGTACAACAGAGCCAGCAAAGATATGTCTAAAGCTGCATCTTA 

15 G 

SEQ ID 25 = Rubber truncated HMGRlml (S566 to A) 
amino acid translation 

MARAS HDVWDLEDTD PNYL I D EDHRLVTC P PAN I S TKTT I IAAPTKLPTSEPLI A 
20 PLVSEEDEMIVNSWDGKIPSYSLESKLGDCKRAAAIRREALQRMTRRSLEGLPV 
EGFDYESILGQCCEMPVGYVQIPVGIAGPLLLNGREYSVPMATTEGCLVASTNRG 
CKAIYLSGGATSVLLKDGMTRAPWRFASATRAAELKFFLEDPDNFDTLAWFNK 
SSRFARLQGIKCSIAGKNLYIRFSCSTGDAMGMNMVSKGVQNVLEFLQSDFSDMD 
VIGISGNFCSDKKPAAVNWIEGRGKSWCEAIIKEEWKKVLKTNVASLVELNML 
25 KMLAGSAVAGALGGFNAHAGNI VSAI F I ATGQDPAQNVES SHC ITMMEAVNDGKD 
LHI SVTMPS IEVGTVGGGTQLASQSACLNLLGVKGANKES PGSNSRLLAAI VAGS 
VLAGELSLMSAIAAGQLVKSHMKYNRASKDMSKAAS 



30 

SEQ ID. 2 6 = Rubber truncated HMGRlm2 (S567 to A) 
nucleotide sequence 

ATGGCACGCGCCTCCCATGACGTGTGGGACCTCGAAGATACGGATCCCAACTACC 
TCATCGATGAAGATCACCGTCTCGTTACTTGCCCTCCCGCTAATATATCTACTAA 

35 GACTACCATTATTGCCGCACCTACCAAATTGCCTACCTCGGAACCCTTAATTGCA 
CCCTTAGTCTCGGAGGAAGACGAAATGATCGTCAACTCCGTCGTGGATGGGAAGA 
TACCCTCCTATTCTCTGGAGTCGAAGCTCGGGGACTGCAAACGAGCGGCTGCGAT 
TCGACGCGAGGCTTTGCAGAGGATGACAAGGAGGTCGCTGGAAGGCTTGCCAGTA 
GAAGGGTTCGATTACGAGTCGATTTTAGGACAATGCTGTGAAATGCCAGTGGGAT 

40 ACGTGCAGATTCCGGTGGGGAT'TGCGGGGCCGTTGTTGCTGAACGGGCGGGAGTA 
CTCTGTTCCAATGGCGACCACGGAGGGTTGTTTGGTGGCGAGCACTAATAGAGGG 
TGTAAGGCGATTTACTTGTCAGGTGGGGCCACCAGCGTCTTGTTGAAGGATGGCA 
TGACAAGAGCGCCTGTTGTAAGATTCGCGTCGGCGACTAGAGCCGCGGAGTTGAA 
GTTCTTCTTGGAGGATCCTGACAATTTTGATACCTTGGCCGTAGTTTTTAACAAG 

45 TCCAGTAGATTTGCGAGGCTCCAAGGCATTAAATGCTCAATTGCTGGTAAGAATC 
TTTATATAAGATTCAGCTGCAGCACTGGCGATGCAATGGGGATGAACATGGTTTC 
TAAAGGGGTTCAAAACGTTCTTGAATTTCTTCAAAGTGATTTTTCTGATATGGAT 
GTCATTGGAATCTCAGGAAATTTTTGTTCGGATAAGAAGCCTGCTGCTGTAAATT 
GGATTGAAGGACGTGGCAAATCAGTTGTTTGTGAGGCAATTATCAAGGAAGAGGT 

50 GGTGAAGAAGGTGTTGAAAACCAATGTGGCCTCCCTAGTGGAGCTTAACATGCTC 
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AAGAATCTTGCTGGTTCTGCTGTTGCTGGTGCTTTGGGTGGATTTAATGCCCATG 
CAGGCAACATCGTATCTGCAATCTTTATTGCCACTGGCCAGGATCCAGCACAGAA 
TGTTGAGAGTTCTCATTGCATTACCATGATGGAAGCTGTCAATGATGGAAAGGAT 
CTCCATATCTCTGTGACCATGCCCTCCATTGAGGTGGGTACAGTCGGAGGTGGAA 
5 CTCAACTTGCATCTCAGTCTGCTTGTCTCAATTTGCTTGGGGTGAAGGGTGCAAA 
CAAAGAGTCGCCAGGATCAAACTCAAGGCTCCTTGCTGCCATCGTAGCTGGTTCA 
GTTTTGGCTGGTGAGCTCTCCTTGATGTCTGCCATTGCAGCTGGGCAGCTTGTCA 
AGAGTCACATGAAGTACAACAGATCCGCCAAAGATATGTCTAAAGCTGCATCTTA 
G 

10 

SEQ ID 27 = Rubber truncated HMGRlm2 (S567 to A) 
amino acid translation 

MARASHDVWDLEDTDPNYL I DEDHRL VT C P PAN I S TKTT I IAAPTKLPTSEPLIA 
15 PLVSEEDEMIVNSWDGKIPSYSLESKLGDCKRAAAIRREALQRMTRRSLEGLPV 
EGFDYESILGQCCEMPVGYVQIPVGIAGPLLLNGREYSVPMATTEGCLVASTNRG 
CKAIYLSGGATSVLLKDGMTRAPWRFASATRAAELKFFLEDPDNFDTLAWFNK 
S SRFARLQG I KCS I AGKNLY I RF S C S TGDAMGMNMVS KGVQNVLE FLQ SD F SDMD 
VIGI SGNFCSDKKPAAVNWI EGRGKS WCEAI I KEEWKKVLKTNVASLVELNML 
20 KNLAGSAVAGALGGFNAHAGNI VS AI F IATGQD PAQNVE S SHC I TMMEA VNDGKD 
LHISVTMPSIEVGTVGGGTQLASQSACLNLLGVKGANKESPGSNSRLLAAIVAGS 
VLAGEL S LMS AI AAGQL VKS HMKYNRS AKDM S KAAS 

Example 6 . Arabidopsis obtusifoliol C14oc- 
25 demethylase constructs 

The Arabidopsis obtusifoliol C14oc-demethylase gene 
was amplified from two separate Arabidopsis mRNA 
samples (SIN 2 and Keto-10) through use of primers 
BXK33 and BXK34, as described below. 

30 

BXK3 3 (SEQ ID 28): 5 ' -GAGATCTGAACCCTAACGAGAG-3 ' 
BXK34 (SEQ ID 29): 5 ' - GGAGCTCTTAAGAAAAGGGACGACGC - 3 ' 

35 The primer BXK3 3 has a Bgl II cleavage site shown 

in bold. The primer BXK34 has a Sac I cleavage site 
shown in bold. The actual size of the structural gene 
is 1.445 Kb. 

The Arabidopsis mRNA was amplified using a Perkin 
40 Elmer GeneAmp RT-PCR kit. The reverse transcription 
reaction used 2 5 mM MgCl 2 (4 jj.1 ; 5 mM final) , 10X PCR 
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buffer (2 ju.1 ) , di DEPC water (1 p.1) , 2 |ul each of 1 mM 
solution of each of four dNTPs (dGTP, dATP, dUTP, 
dCTP) , RNase inhibitor (1 of 10 units per (4.1 stock) , 
MMLV reverse transcriptase (1 fj,l of a 2.5 U/p.1 stock), 

5 Oligo d(T)16 Primer (1 p,l of a 2.5 (iM stock), and 2 \il 
of an Arabidopsis polyA RNA sample. The reaction mix 
was incubated at room temperature (about 2 0°C) for 10 
minutes, then in a PCR machine for one cycle (15 min. 
at 42°C, 5 min. at 99°C and 5 min. at 4°C) . 

10 Separate primer-mediated amplification reactions 

were carried out using Taq DNA polymerase and Vent DNA 
polymerase to obtain Arabidopsis obtusifoliol C14a- 
demethylase cDNA from the amplified mRNA sample. 



Taq PCR Reaction 


Vent PCR Reaction 


4 pi 25 mM MgCl 2 


4 ul 2 5 mM MgCl 2 


8 ul 10X PCR buffer II 


8 ul 10X Vent PCR buffer 


65.5 ul di DEPC water 


65.5 ul di DEPC water 


0.5 ul AmpliTaq polymerase 


0.5 ul Vent polymerase 



15 

After 1 minute and 35 seconds at 95°C, 1 ul each of 
15 uM stocks of the upstream and downstream primers 
(BXK3 3 and BXK3 4) were added to the PCR reaction (10 0 
ul total PCR reaction volume) and the PCR reaction 

20 solutions were subjected to 35 cycles (95°C for 15 

seconds, then 60°C for 3 0 seconds) . The amplified PCR 
reaction was then maintained at 72°C for 7 minutes and 
then stored at 4°C. An amplification positive control 
reaction was carried out under the same conditions with 

25 DM151 and DM152 primers. 



DM151 (SEQ ID 30): 5 ' -GTCTCTGAATCAGAAATCCTTCTATC-3 ' 
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DM152 (SEQ ID 31): 5 ' -CATGTCAAATTTCACTGCTTCATCC-3 ' 



Electrophoresis of the nucleic acid solutions 
after PCR amplification displayed an amplification 
product corresponding approximately to the size of the 
desired 1.445 Kb structural gene. The fragment was 
cloned into an M13 vector. A representative sequencing 
reaction consisted of: 10 ul of plasmid DNA (200-500 
ng) , 2 |il of M13 Forward or Reverse primer (15 
picomoles) and 8 ^1 of Big Dye Terminator Reaction Mix 
(PE Applied Biosci.) . The clone copy of ATA101105 was 
called CPR17398. The sequence of the selected clone 
{Arabidopsis obtusifoliol C14a-demethylase) is 
identified as SEQ ID NO : 9 . 

The predicted polypeptide sequence for the cloned 
Arabidopsis obtusifoliol C14a-demethylase sequence was 
subjected to a BLAST search in the public database and 
found to align with the sorghum obtusifoliol 14 -alpha 
demethylase polypeptide (ATA101105/U74319/gl658192 ; and 
gl216657/U74319) exhibiting 75-78% sequence identity 
and 87-90% sequence homology. The cloned nucleic acid 
encoding Arabidopsis obtusifoliol C14cc-demethylase (SEQ 
ID No: 9) is missing the 5' end. 

The 5' terminal portion of the structural gene was 
obtained by the RACE (Rapid Amplification of cDNA Ends) 
PCR using primers BXK39 and BXK40 per manufacturer's 
instructions (Clontech) . 

BXK39 (SEQ ID 32): 5 ' -GAGATCTCCACAGATTTAAAGAACCCTAACG-3 
BXK4 0 (SEQ ID 33): 5 ' -GGAGCTCGGTTTTTAAGAAAAGGGACGACGC-3 

The cloned nucleic acid encoding full length 
Arabidopsis obtusifoliol C14a-demethylase is identified 
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as SEQ ID No: 8. The amplified Arabidopsis obtusif oliol 
C14a-demethylase structural gene is useful for making 
constructs that express Arabidopsis obtusifoliol C14oc- 
demethylase in transgenic plants. 
5 In light of the detailed description of the 

invention and the examples presented above, it can be 
appreciated that the several aspects of the invention 
are achieved. 

It is to be understood that the present invention 

10 has been described in detail by way of illustration and 
example in order to acquaint others skilled in the art 
with the invention, its principles, and its practical 
application. Particular formulations and processes of 
the present invention are not limited to the 

15 descriptions of the specific embodiments presented, but 
rather the descriptions and examples should be viewed 
in terms of the claims that follow and their 
equivalents. While some of the examples and 
descriptions above include some conclusions about the 

20 way the invention may function, the inventor does not 
intend to be bound by those conclusions and functions, 
but puts them forth only as possible explanations. 

It is to be further understood that the specific 
embodiments of the present invention as set forth are 

25 not intended as being exhaustive or limiting of the 
invention, and that many alternatives, modifications, 
and variations will be apparent to those of ordinary 
skill in the art in light of the foregoing examples and 
detailed description. Accordingly, this invention is 

30 intended to embrace all such alternatives, 

modifications, and variations that fall within the 
spirit and scope of the following claims. 



